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THE  PROBLEM 


Develop  new  radar  techniques  based  upon  statistical  meth¬ 
ods  which  are  applicable  to  automatic  radar  systems  with  in¬ 
creased  detection  capabilities.  The  specific  phase  reported  here 
is  a  review  and  evaluation  of  some  of  the  distribution-free  statisti¬ 
cal  techniques  which  can  possibly  be  used  in  signal  detection  and 
CCM  applications. 


RESULTS 

The  use  of  distribution-free  detectors  is  quite  feasible 

since: 

1.  Several  of  the  distribution-free  detectors  are  uniformly 
better  than  the  parametric  detectors  for  certain  classes  of 
Gaussian  noise. 

2.  Several  simpler  distribution-free  detectors  are  almost  as 
efficient  as  their  parametric  counterparts. 


RECOMMENDATIONS 


1.  Initiate  procedures  to  implement  the  design  and  construc¬ 
tion  of  one  or  more  of  the  better  distribution-free 
detectors. 

2.  Investigate  better  comparison  for  the  various  types  of 
distribution-free  detectors. 


3.  Initiate  simulated  tactical  tests  of  distribution-free  detec¬ 
tors  as  soon  as  possible. 

4.  Review  current  research  in  distribution-free  statistics  for 
additional  applications  to  distribution-free  detectors. 
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I.  INTRODUCTION 

A.  GENERAL 


Signal  detection,  as  employed  in  automatic  radar  (or  sonar) 
systems,  is  treated  as  a  problem  of  testing  statistical  hypotheses. 
In  its  simplest  form,  this  reduces  to  a  choice  between  signal  or  no 
signal.  The  choice  is  inferred  from  the  data  collected  by  the  ra¬ 
dar  (or  sonar)  system  through  the  application  of  a  decision  rule  or 
process.  When  the  statistical  nature  of  the  data  is  known  for  the 
signal  and  no-signal  cases,  then  a  conventional  procedure,  such  as 
the  Neyman-Pearson  test  or  the  sequential  probability  ratio  test, 
may  be  used. 

When  the  noise  or  signal-plus-noise  distribution  is  unknown, 
the  afore -mentioned  detection  methods  present  two  major  difficul¬ 
ties.  If  the  underlying  statistics  are  known  to  be  unknown,  then 
the  detection  process  fails  to  materialize;  if  they  are  incorrectly 
assumed,  the  consequent  detector  performance  can  be  absurd. 

Distribution-free  methods  of  signal  detection  can  be  em¬ 
ployed  whenever  the  underlying  signal  and  noise  distributions  are 
unknown.  This  can  occur,  for  instance,  in  a  jamming  or  counter¬ 
measures  environment,  or  possibly  in  less  hostile  circumstances 
where  the  lack  of  physical  knowledge,  or  where  a  priori  knowl¬ 
edge  of  physical  considerations  indicates  uncertainties  in  data 
statistics  in  either  a  temporal  or  spatial  sense. 


The  study  reported  here  involved  a  number  of  distribution- 
free  tests  which  offer  some  promise  of  improved  performance 
under  circumstances  such  as  those  previously  discussed.  Informa¬ 
tion  related  to  the  required  data  processing,  distribution-free  de¬ 
tector  characteristics,  and  comparison  of  distribution-free  detec¬ 
tors  and  classical  detectors  is  presented.  The  physical  or  tactical 
considerations  related  to  the  choice  of  a  distribution-free  detector 
for  any  particular  application  is  not  discussed  and  must  be  con¬ 
sidered  at  some  later  time.  Indeed,  the  relation  is  strengthened 
and  generalized  by  the  avoidance  of  particular  and  restrictive 
applications.  Specific  radar  systems  are  not  considered  but, 
instead,  a  general  approach  is  used. 

The  general  approach  consists  of  treating  the  radar  target 
and  other  factors  external  to  the  radar  system  proper  as  a 
"transmission  medium"  through  which  the  desired  signal  is  not 
only  received  but  is  obscured  by  noise.  The  signal  might  result 
from  the  target's  reflection  of  radar  energy  derived  from  the  elec¬ 
tromagnetic  waves  from  the  radar  transmitter,  or  the  target  itself 
might  be  the  primary  energy  source.  The  noise  might  be  additive 
such  as  atmospheric  noise  or  artificial  interference,  or  might  con¬ 
sist  of  additive  as  well  as  signal -perturbing  components.  Three 
models  are  presented. 

Detection  is  accomplished  by  means  of  a  device  which  re¬ 
ceives  known  pure  noise,  possible  signal  (PS)  data,  and/or  arti¬ 
ficial  noise,  and  from  these  inputs  "decides"  (YES  or  NO)  whether 
or  not  a  signal  is  present.  In  making  this  decision,  the  detector 
is  capable  of  producing  two  types  of  erroneous  outputs:  (1)  false 
alarm  (FA),  if  NO  signal  is  present  and  it  decides  YES;  and  (2) 
false  dismissal  (FD)  if  YES  a  signal  is  present  and  it  decides  NO. 

The  probability  that  the  detector  will  produce  a  false  alarm 
(PFA)  will  be  designated  a;  /3  will  denote  the  probability  that  the 
detector  will  produce  a  false  dismissal  (PFD). 

The  perfect  detector,  which  would  produce  a  =  0  and  j8  =  0, 
has  not  yet  been  constructed.  However,  there  is  considerable 


OBSERVATION 

VALUE 


interest  in  the  so-called  ideal  detector,  which  minimizes  the  prob¬ 
ability  of  an  error,  either  FA  or  FD 

Pi 3  +  (1-P)  Oi 

where  p  is  the  a  priori  probability  that  a  signal  is  present.  To 
construct  the  ideal  detector  it  would  be  necessary  to  know  the 
probability  p. 

Interest  will  be  directed  here  to  the  distribution-free  detec¬ 
tor  which  in  a  subsequent  section  will  be  defined  and  compared  with 
the  uniformly  minimum  PFD  detector  which,  for  fixed  PFA  a, 
minimizes  PFD  j3.  It  is  first  necessary  to  consider  some  of  the 
statistical  properties  of  pure  noise  and  of  noise -plus -signal. 


B.  STATISTICAL  PROPERTIES  OF  NOISE  AND  SIGNAL 


Throughout  it  is  assumed  that 
the  PS  data  received  by  the  de¬ 
tector  consist  of  discrete  ob¬ 
servations  Xj ,  X2  ,  ’  ’  • ,  Xn  of  a 
continuous  time  process  at 
times  tj,  t2,  *  ’  ’ »  tn ;  and  that 
the  X's  are  measurements  of 
voltage,  current,  power,  etc. 
Further,  the  following  restric¬ 
tions  will  be  imposed  on  the 
succeeding  analysis. 


1.  Xx ,  X2  ,  *  •  * ,  Xn  are  statistically  independent  random 
variables; 

2.  Xlf  X2,’”,  Xn  have  a  common  strictly  increasing  continu¬ 
ous  cumulative  probability  function  (cpf)  F  such  that  F(x)  = 

P(Xi^x). 

(It  is  clear  from  the  definitions  that  a  cpf  F  is  mono¬ 
tone  non-decreasing  so  that  F(-°°)  =  0  and  F(  +00)  =  1. ) 
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3.  If  the  PS  data  are  pure  noise,  then  F  -  FQ  ;  if  the  PS  data 

are  noise-plus-signal,  then  F  =  F1  ^  FQ . 

The  first  two  assumptions  are  equivalent  to  the  statement 
that  Jj ,  •  •  • ,  Xn  constitute  a  random  sample  from  a  population  with 
cpf  F.  More  specifically,  assumption  2  states  that  the  observed 
process  is  stationary  or  time -invariant.  Under  certain  circum¬ 
stances  this  assumption  may  not  hold  for  long  periods  of  time,  but 
should  be  approximately  valid  for  relatively  short  periods. 

Assumption  1,  the  assumption  of  independence,  which  is 
valid  in  some  situations,  also  represents  an  approximation  in  many 
real  situations.  The  approximation  will  be  quite  good  if,  for 
example,  the  continuous  time  process  is  a  normal  or  Gaussian 
process  in  which  the  covariance  function  decreases  to  zero  rapidly 
relative  to  the  time  intervals  (  t  -  t  _i)  between  observations. 

Assumption  3  is,  of  course,  essential  to  the  whole  concept 
of  statistical  methods  in  detection.  If  the  statistical  properties  of 
noise  and  noise-plus-signal  are  not  different,  then  it  will  not  be 
possible  to  design  a  detector  using  statistical  procedures  except 
with  the  undesirable  property  that  a  +  /3  =  1,  i.  e.  ,  with  (PFA) 

=  l-(PFD). 

As  previously  mentioned,  the  detector  will  have  as  inputs 
under  certain  circumstances  not  only  PS  data,  but  also  pure  noise 
as  well  as  artificial  noise.  The  pure-noise  observations  will  be 
represented  as  71,  Y2  ,  •  •  • ,  Y~L  and  will  also  satisfy  assumptions  1 
and  2  and,  of  course,  assumption  3  with  F  =  F0  .  Further,  when¬ 
ever  PS  data  and  pure  noise  are  simultaneously  available,  they  will 
be  assumed  statistically  independent. 

In  order  to  facilitate  computations  or  to  achieve  certain 
exact  PFA  values  a,  it  will  sometimes  be  feasible  to  introduce 
artificial  noise  observations  ,  "2  ,  •  •  • ,  W-j  .  These  random  vari¬ 
ables  will  be  assumed  to  satisfy  assumptions  1  and  2  and  to  have 
some  continuous  cpf  H,  which  is,  in  general,  not  equal  to  F0  nor 
to  J1] . 


With  these  preliminary  statistical  considerations  it  is  now 
possible  to  introduce  the  three  major  statistical  models  of  detectors 
to  be  considered  in  subsequent  sections. 


C.  THREE  DISTRIBUTION-FREE  DETECTOR  MODELS 


Before  proceeding  to  a  study  of  distribution-free  models 
one  should  point  out  that  in  the  event  that  both  F0  and  F\  are  known, 
and  PS  data  Xlt  •  •  • ,  Xn  are  available,  it  is  possible  from  well- 
known  statistical  considerations  to  design  a  uniformly  minimum 
PFD  detector,  i.  e. ,  one  which  for  fixed  PFA  a  minimizes  PFD  (3  . 
Of  course,  such  a  detector  would  not  in  general  have  this  minimiz¬ 
ing  property  for  other  F0's  and  F1  's  and,  hence,  would  not  be 
practicable  for  use  except  when  the  noise  and  noise-plus -signal 
cpf's  are  approximately  F0  and  F1  ,  respectively. 

Detectors  whose  performance  (PFA,  etc.)  does  not  depend 
on  a  specific  fixed  continuous  F0  will  be  called  distribution-free 
detectors.  The  models  of  distribution-free  detectors  to  be  studied 
here  are  based  on  three  reasonable  types  of  statistical  situations: 

MODEL  I:  The  pure-noise  cpf  F0  is  known  and  there  are  avail¬ 
able  PS  data  X1}  X2,  •  •  • ,  Xn  whose  cpf,  >  is 
unknown. 

MODEL  II:  Both  the  cpf's  F0  and  F1  are  unknown,  but  there  are 
available  PS  data  sample  Xlt  X2,  •  •  • ,  Xn  and  pure- 
noise  sample  Ylt  •  •  •  ,  Ym  .  (This  model  was  primarily 
developed  by  J.  Capon;  see  reference  1  in  list  at  end 
of  report. ) 

MODEL  HI:  F0 ,  F1  and  a  pure  noise  sample  are  not  available,  but 
the  scanned  regions  can  be  divided  into  subregions 
and  each  PS  data  observation  can  be  classified  accord¬ 
ing  to  its  subregion  of  origin  as  follows: 

1st  Subregion:  Xn  ,  X12,---, 

2nd  Subregion:  X21  ,  L22>"'’>  ^2,r:9 


kth  Subregion:  Xkl  ,  Xk2,--~, 
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Fn(Z) 


In  making  decisions  on  the 
basis  of  Models  I,  II,  and  m, 
it  is  customary  to  introduce  the 
well-known  statistical  concept 
of  sample  cpf.  For  the  obser¬ 
vations  Xlt  X2,  •  •  • ,  Xn  ,  the 
sample  cpf  Fn  is  a  step  function 
such  that  Fn(s)  =  A  (number  of 
X's  —  s)  for  all  3 ,  as  illus¬ 
trated  here  for  n  ~  4. 


Somple  cpf  for  four  observotions . 


In  terms  of  the  sample  cpf's  the  basic  approaches  of  the 
three  models  are  as  follows. 


1.  A  Model  I  detector  compares  the  PS  sample  cpf  Fn  with  the 
pure-noise  cpf  F0  and  decides  YES  if  and  only  if  Fn  is  "too 
far"  from  FQ.  Essentially,  each  different  Model  I  detector 
uses  a  different  definition  of  "too  far. "  These  ideas  are  de¬ 
veloped  more  fully  in  Section  II. 


2.  A  Model  II  detector  compares  the  sample  cpf  Fn  of  the  PS 
data  Jj,  •  •  • ,  Xn  with  the  sample  cpf  Fm  of  the  pure-noise 
observations  Y\,  T2,  •  •  • ,  Ym  ,  and  employs  extensions  of  the 
ideas  mentioned  above  for  Model  I.  In  Section  m  this  is 
developed  in  more  detail. 


3.  A  Model  in  detector  compares  the  sample  cpf's  Fni, 

Fn2  ,  •  •  • ,  Fny  of  the  observations  from  the  k  subregions. 
This  model  is  treated  in  more  detail  in  Section  IV. 


Before  discussing  the  three  models  in  detail  it  seems  worth¬ 
while  to  evolve  a  formula  for  producing  distribution-free  detectors. 
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D.  CHARACTERIZATION  OF  A  USEFUL  CLASS  OF  DETECTORS 


Paralleling  known  statistical  techniques,  one  can  character¬ 
ize  the  class  of  Model  I  detectors  by  imposing  a  natural  restriction 
on  the  PFD  /3.  Note  first  that  if  both  the  pure-noise  cpf  F0  and  the 
noise -plus -signal  cpf  F1  are  normal  (or  Gaussian)  cpf's,  then  it  is 
known  (see  Section  II  D)  that  for  each  PFA  ol,  p  =  j3  ( F0  ,  h  )  depends 
only  on  the  function  F0(F{'1),  i.  e. ,  the  composition  of  F0  and  the 
inverse,  Ff1 ,  of  F1.  Extending  this  idea  to  the  distribution-free 
case  one  makes  the  following  definition. 

Definition  1.  A  detector  is  called  SDF  (strongly  distribution- 
free)  if  for  each  PFA  a,  each  pure-noise  cpf  F0,  and  each  noise- 
plus-signal  cpf  Fu  the  PFD,  /3  =  /3  (F0 ,  F1  ),  depends  only  on  the 
function  F0( F^1)  for  all  strictly  monotone  continuous  cpf's  F0  and 
F1  .  This  would  mean,  for  example,  that  an  SDF  detector  would 
for  each  fixed  PFA  a  have  the  same  PFD  (3  in  both  of  the  cases 
illustrated.  This  so  since  in  both  of  these  cases  Fx  =  F02 and 
F0|lr1-1(u)j  =  for  all  u. 


Further  one  needs 


Definition  2.  A  Model  I  detector  is  symmetric  if  its  deci¬ 
sions  do  not  depend  on  the  chronological  order  of  the  Xlt  ■  •  • ,  Xn  . 
(Of  course,  such  a  detector  is  not  sequential.) 

From  the  statistical  results  of  Birnbaum  and  Rubin2  and 
those  of  Bell3,4  it  is  known  that 

Theorem  1.  An  SDF  symmetric  Model  I  detector  must  base 
its  YES-NO  decisions  on  a  statistic  of  the  form  ^[/o(^i)»  •  •  •  , 

where  tp  is  a  symmetric  function  and  F0  is  the  known  con¬ 
tinuous  pure-noise  cpf. 

Table  1  gives  some  examples  of  SDF  symmetric  Model  I 
detectors,  with  J(l)  <  X(2)  <  ••  •  <  X(n)  representing  the  ordered 
values  of  Xl5  X2, '  ‘  ‘ .  Xn. 


Table  1.  Some  SDF  Symmetric  Model  I  Detectors 


Name  of 

Detector 

Decision  YES 
iff  (if  and  only  if) 

Kolmogorov  Dn 

max  max 

_ i- 

Fr(x(i))~,  £-fn(*(i))]  >d ,(n 

Fisher  v 

-2  2  In  F0(X a)  <  d2(n  ,  a  ) 

U 

2  0(X  )  <  d^(n ,  a) 

Sign  Q'p 

o_1(p))  >  d4(n»  a,p  ) 

Sherman  S 

2  F0(x(i)) 

-F0(j(i-1))-  “  >  d5(n,  a) 

For  Model  II  detectors  it  is  feasible  also  to  impose  the  SDF 
condition,  i.  e. ,  that  (3  ( F0  ,  Fl  )  depend  only  on  F0(F^1 ).  However, 
in  giving  the  characterization  of  Model  II  detectors  one  must  recall 
the  concept  of  ranks  R{Xx),  •  •  • ,  R(Xn);  /?(73),  •  ■  • ,  R(Ym)  in  the 
combined  sample. 


Example  J.  If  the  pure-noise  observations  are  7X  =  2.1, 

Y2  =  3.  2,  73  =  0.  9,  and  the  PS  data  are  Xi  =  0.  8,  X2  -2.5,  X3  -  1.  6, 
I4  =  3. 1,  then  one  has 

1.  as  order  statistics  for  the  7-sample  7(1)  =  0.9,  7(2)  =  2.1, 
7(3)  =  3.2; 

2.  as  order  statistics  for  the  7-sample  7(1)  =  0.8,  7(2)  =  1.6, 
7(3)  =2.5,  7(4)  =  3.1; 

3.  as  the  order  in  the  combined  sample  7777777  ;  and, 
hence, 

4.  that  the  7's  have  ranks  1,  3,  5,  and  6  in  the  combined 
sample,  while  the  7's  occupy  ranks  2,  4,  and  7.  More 
specifically,  R  (7j)  =  1,  R( 72)=5,  R{ 73)  =  3,  #(74)  =6, 
R(Y1)=  4,  R  (72)  =  7,  R(Y3)  =2. 

Further  one  needs  the  definitions: 

Definition  3 ■  A  Model  II  detector  is  a  rank  detector  if  its 
decisions  depend  only  on  the  ranks  of  the  7's  (and/or  7's;  and  not 
on  their  numerical  values). 

Definition  4.  A  Model  II  detector  is  SWS  (samplewise 
symmetric)  if  its  decisions  depend  neither  on  the  chronological 
order  of  7j,  •  •  • ,  Xn  nor  on  the  chronological  order  of  7l,  *  *  •  .  Ym  • 

It  is  known  from  statistical  considerations3,5  that  each  rank 
detector  is  an  SDF,  SWS  detector.  However,  one  needs  the  con¬ 
verse  of  this  result,  and  to  that  end  must  introduce  an  additional 
restriction.  This  mathematical  condition  first  considered  by 
Scheffe5’6  is  satisfied  by  all  of  the  Model  n  detectors  known  to  the 
author;  but  the  full  significance  of  the  condition  is  not  evident  to 
the  author. 

Definition  5-  A  Model  II  detector  is  a  Scheffe  detector  if  its 
decisions  are  based  on  a  statistic  T  =  T( 7j,  •  •  • ,  Xn  ;  7j ,  *  *  * ,  Ym ) 
such  that  the  boundary  of  the  inverse  image  T~l(B)  of  each  Borel 
set  B  has  probability  0  with  respect  to  each  continuous  (power) 


measure. 


This  Scheffe  condition  is  quite  complex,  and  need  not  con¬ 
cern  the  reader  other  than  as  a  condition  to  guarantee  the  following 
theorem. 


Theorem  2.  An  SDF,  SWS  Model  II  Scheffe  detector  must 
be  a  rank  detector,  i.  e. ,  must  base  its  YES-NO  decisions  solely 
on  the  ranks  of  the  Xlz  •  •  •  ,  Xn  and  Y\,  •  •  • ,  Ym  in  the  combined 
sample. 


Typical  examples  of  Model  II  rank  detectors  are  given  in 

table  2. 


Table  2. 


Name 

Decision:  YES  iff 

Cramer-von 

Mises 

J 

Fn(s)  ~  Fm(s)  2  d 

nFn(s)  +  mFm(s) 

>  d\ 

n  +  m 

Mann- 

Whitney- 

Wilcoxon 

^  R(Xi)  >  d2 

For  a  Model  HI  detector  one  recalls  that  there  are  available 
PS  data  from  each  of  k  subregions; 


Xn , 

A12’  ”  ’  > 

*l.n, 

X21  , 

^22 ,  '  '  *  , 

A2,n2 

Xkl’  Xk2>  "  '  ’  ^k,nk 

Model  m  detectors  can  be  characterized  (as  can  Model  II) 
by  requiring  the  SDF,  SWS,  and  Scheffe  condition. 
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Theorem  3 ■  An  SDF,  SWS  Model  III  Seheffe  detector  must 
be  a  rank  detector,  i.e. ,  must  base  its  YES-NO  decisions  solely 
on  the  ranks  •  •  • ,  R{Xk  _  )  of  the  samples  from  the  k  sub- 

regions. 

Typical  examples  of  Model  in  rank  detectors  are  given  in 

table  3. 


Table  3.  Some  Model  III  Rank  Detectors 


Name 

Decision: 

YES;  iff 

Kolmogorov 

k 

max  2  rii 

i= 1 

Fri.(x)  -F(x)j  >  d1 

where  F  = 

?rii  and  N  =  X,ni 

Kruskal  - 

Wallis 

12  y  R%i 

-  3  (N+ 1)  >  d2 

N(N+1)  nt 

where  Ri  -X,R(Xi  /) 

J 

Mosteller- 

Tukey 

Length  of  last  run  >  d3 

Before  summarizing  this  section  on  characterization  one 
should  make  two  relevant  comments. 

1.  Whereas  the  rank  detectors  correspond  to  a  large  class  of 
2-sample  and  fr-sample  statistics  in  common  usage,  one 
important  class  of  detectors  has  been  omitted.  These  are 
the  Pitman  type,  which  are  not  rank  detectors  as  defined 
here. * 

2.  In  subsequent  sections  artificial  noise  will  sometimes  be 
introduced  into  the  Model  II  and  Model  HI  detectors  in  order 
to  achieve  previously  stated  aims.  For  Model  II,  the  in¬ 
dividual  PS  data  observations  Xv  •  •  • ,  Xn  and  the  individual 


*  Their  statistics  are  discussed  in  ref.  7  and  in  ref.  8,  p.  489. 
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pure-noise  observations  Ti,  •  •  • ,  Ym  will  be  replaced  by  that  one  of 
the  artificial  noise  observations  Wlt  W2,  —  ,  WN  (N=  n  +m)  with  the 
same  rank.  For  example,  if  X2  has  rank  5  in  the  combined  X-Y 
sample  then  it  will  be  replaced  by  that  ¥$  iff  W§  has  rank  5  in  the  ¥ 
sample.  Thus,  the  rank  ordering  of  the  transformed  observations 
is  exactly  that  of  the  original  observations.  Consequently,  a  de¬ 
tector  which  bases  its  decisions  on  the  W's  is  also  a  rank  detector. 
Analogous  statements  are  also  valid  for  Model  III  detectors.  These 
ideas  are  treated  in  more  detail  in  Sections  III  and  IV. 

Table  4  summarizes  the  results  of  this  section. 


Table  4.  Summary  for  Model  I,  II,  and  III  Detectors 


Model  I 

Model  II 

Model  III 

Cpf's 

known 

Pure-noise 
cpf  F0 

None 

None 

Data 

available 

PS  data: 
xi,  •  •  • ,  Xn 

PS  data: 

Xi,’",  *n 

Pure-noise  data: 
Yl,  *  *  * ,  7m 

Regional  PS  data: 
^11.  X\2,  '  ’  *  ,  4'i,n 

jV21>  -!22>  ’  ’  ’  > 

Yk  1>  Xk2  •  •  • ,  Xk,n 

Character¬ 

ization 

assumptions 

SDF,  symmetric 

SDF,  symmetric 
Scheffe 

SDF,  symmetric 
Scheffe/ 

Structure  of 

detector 

statistic 

^ l[  0  (  i)>  *  *  '  > 

?  (  • 

O'/ 

R(7m) 

^3]  (  ii)>  * 1  * » 

Since  the  number  of  statistics  having  the  structures  indi¬ 
cated  is  infinite  for  each  model  above,  some  method  of  narrowing 
the  classes  of  possibilities  should  be  devised.  This  is  treated  in 
the  next  section. 
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E.  GOODNESS  CRITERIA  FOR  DETECTORS 


Since  the  number  of  possible  detectors  is  infinite  it  would 
seem  reasonable  that  not  all  of  them  are  equally  "good.  "  Conse¬ 
quently,  many  of  these  possible  detectors  can  be  eliminated  from 
consideration  by  the  imposition  of  reasonable  goodness  criteria. 

In  other  words,  consideration  will  be  restricted  here  to  those  de¬ 
tectors  which  satisfy  one  or  more  practical,  reasonable  goodness 
criteria. 

Since  a  detector  is  essentially  a  decision-maker,  the  ulti¬ 
mate  objective  is  the  design  of  a  detector  which  minimizes  loss  in 
some  appropriate  sense.  The  losses  in  the  given  models  are,  of 
course,  of  two  possible  types: 

L(FD),  the  loss  incurred  by  an  FD  (false  dismissal),  i.e. , 

by  deciding  NO  when  there  is  signal  present;  and 

L(FA),  the  loss  incurred  by  an  FA  (false  alarm),  i.e.  ,  by 

deciding  YES  where  there  is  only  pure  noise. 

If  the  detection  process  is  to  be  repeated  in  a  large  number 
of  locations  or  for  a  large  number  of  times  in  a  given  location, 
then  (as  is  assumed  throughout  this  work)  statistical  considerations 
such  as  expected  loss  or  risk,  PFA  a,  PFD  |3,  etc.  ,  are  important, 
and  some  consideration  of 

p,  the  2  pr  i  or  i  probability  of  noise-plus -signal,  is 

necessary. 

As  previously  mentioned  the  most  desirable  detector  would 

be  the 

1.  perfect  detector,  for  which  a  =0  and  /3  =  0 . 

Of  course,  such  a  detector  rarely,  if  ever,  exists.  How¬ 
ever  it  is  possible  to  achieve  a  =0  by  always  deciding  NO;  or  j3  =  0 
by  always  deciding  YES.  These  are 


2.  trivial  detectors. 


A  more  practical  detector  can  be  obtained  for  large  classes 
of  pure  noise  and  noise-plus-signal  cpf's  if  p,  L(FD)  and  L(FA) 
are  known.  Such  a  detector  is  a 

3.  Bayesian  detector,  which  minimizes  the  risk 

p/3  L(FD)  +  (1  -p)  q;  L(FA). 

In  the  event  that  the  loss  function  L  is  not  known  or  if 
L(FD)  =  L(FA),  one  may  settle  for  an 

4.  ideal  detector,  i.e.,  one  which  minimizes  p/3  +  (1  - p) a. 

In  many  practical  cases,  of  course,  neither  p  nor  the 
losses  L(FA)  and  L(FD)  are  known,  and  one  is  therefore  led  to 
criteria  based  solely  on  PFAa  and  PFD/3.  From  one  point  of  view 
the  "best"  such  detector  is  the 

5.  uniformly  minimum  (UM)  PFD  detector,  which,  for  fixed  a, 
gives  minimum  PFD/3foreach  noise-plus -signal  cpf  F1. 

It  is  well  known  from  the  Neyman-Pearson  result  that  for 
each  fixed  pure-noise  cpf  F0  and  noise-plus-signal  cpf  Fx ,  there 
exists  a  detector  which  for  fixed  ol  minimizes  /3.  However,  in 
general,  no  one  detector  will  minimize  /3  for  a  "large"  class  of 
cpf's  F0  and  F1  ;  and  a  uniformly  minimum  PFD  detector  will  exist 
only  in  special  cases.  Hence,  one  needs  other  reasonable  goodness 
criteria. 

A  detector  D  is  called  an 

6.  admissible  detector  if  there  is  no  other  detector  with  uni¬ 
formly  smaller  /3,  i.e. ,  if  for  each  other  detector  B2  there 
is  some  pure-noisc  cpf  F2  such  that  (3(Di,  F2  )  —  P(D2,F2  ); 
an 

7.  unbiased  detector  if  a  +  /3  ^1  for  all  pure-noise  cpf's  F0, 
i.e.,  if  P(YES|  pure  noise)  ^  P(YES  |  noise-plus-signal); 
and  a 


8.  consistent  detector  if  for  each  fixed  PFAa,  lim/3  =  0  as  the 

sample  size(s)  increase(s)  without  bound. 

Criteria  6,  7,  and  8  are  quite  reasonable.  If  one  detector 
has  smaller  /3  than  a  second  detector  under  all  circumstances,  then 
one  would  prefer  the  first  detector.  Hence,  admissibility  is  a 
reasonable  property  to  require  of  detectors. 

Also,  it  seems  reasonable  that  one  would  want  to  restrict 
consideration  to  detectors  which,  for  fixed  a,  have  increasingly 
small  PFD (3  as  the  quantity  of  data  increases. 

Finally,  as  is  the  opinion  of  many  who  engage  in  applications 
of  statistics,  it  seems  desirable  to  have  the  detector  decide  YES 
more  often  when  there  is  signal  than  when  there  is  only  pure  noise. 
Such  a  detector  is  unbiased. 

Up  to  this  point  it  has  been  seen  that 

a.  perfect  detectors  rarely,  if  ever,  exist; 

b.  trivial  detectors  can  always  be  constructed; 

c.  Bayesian  and  ideal  detectors  require  specialized 
a  pr  i  or  i  knowledge;  and 

d.  uniformly  minimum  PFD  detectors  exist  only  in  special 
cases. 

Further,  it  can  be  shown  that  the  requirements  of  unbiased¬ 
ness,  admissibility,  and  consistency  neither  reduce  the  class  of 
possible  detectors  to  a  "reasonable"  number  nor  provide  criteria 
for  comparison.  Consequently,  one  is  not  in  a  position  to  construct 
a  "best"  detector  at  this  time;  nor  can  one  on  the  basis  of  the  pre¬ 
ceding  criteria  make  statements  of  the  form  "detector  D1  is  50  per 
cent  better  than  detector  D2.  "  In  an  attempt  to  satisfy  this  need  one 
introduces  the  following  two  concepts: 

9.  Detector  D\  is  max-min-/3  better  than  detector  D2  for  pure- 

noise  cpf  class  oV  =  \Fa  |  and  noise-plus -signal  cpf  class 

2  =  {Fj  if,  for  each  F0  in  tAI, 


sup/3(Di)  =s  sup  P(D2) 


and 


inf  0(Dj)  ^  inf  (3(p2) 

where  the  suprema  and  infima  are  taken  over  the  class  S  of  noise- 
plus-signal  cpf's. 

The  rationale  for  this  criterion  (essentially  conceived  by 
Chapman,9  and  used  by  Bell,  Moser,  and  Thompson10  )  is  as  follows. 
If  detector  D i's  best  performance  is  better  than  detector  D2's  best 
performance  for  specified  reasonable  classes  <_A1  =  jf0)  and 
5  =  1^}  and  if  detector  s  worst  performance  is  better  than  de¬ 
tector  D2's  worst  performance  for  these  classes,  one  says  that  de¬ 
tector  Di  is  "better"  than  detector  D2  (for  these  classes).  Of 
course,  this  rating  of  detectors  depends  explicitly  on  the  classes 
chosen;  and,  further,  one  suspects  that  only  in  special  cases  will 
both  sup  /3(Dj)  ^  sup  (3(D2)  and  inf  /?(/)])  —  inf  (3(D2). 


The  second  of  these  two  concepts  is  that  of  asymptotic  rela¬ 
tive  efficiency.  Any  detector  of  "feasible"  design  can  be  expected 
to  have  good  performance  when  there  is  a  strong  signal.  Conse¬ 
quently,  it  seems  reasonable  to  compare  two  detectors  on  the  basis 
of  their  relative  performances  in  the  presence  of  increasingly  weak 
signals. 


10. 


The  asymptotic  relative  efficiency  (ARE),  A  (Dj,  D2)  of  de¬ 
tector  D1  with  respect  to  detector  D2,  is  defined  to  be  the 


limit,  lim 


rc(A>) 

n(9x) 


,  of  the  ratio  of  their  sample  sizes  as 


fixed  a  and  (3  are  maintained  as  the  signal  strength  tends 
to  zero,  i.  e. ,  as  F1  tends  to  F0  . 


This  statistical  concept  (often  attributed  to  Pitman7)  has 
been  recently  treated  by  a  large  number  of  authors.11-18  In  order 
for  the  limit  to  exist,  be  unique,  and  be  independent  of  a  and  (3,  a 
large  number  of  regularity  conditions  are  needed.  (These  are  men¬ 
tioned  in  Section  II  D.)  Further,  although  one  can  make  statements 
of  the  form  "D\  is  50  per  cent  better  than  D2"  on  the  basis  of  ARE, 
the  ARE  depends  quite  heavily  on  the  pu re-noise  cpf  F0  and  the 
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manner  in  which  the  signal  tends  to  zero,  i.  e. ,  the  manner  in 
which  the  noise-plus-signal  cpf  F1  tends  to  F0- 

Consequently,  in  order  to  apply  criterion  9  and  rank  a  given 
set  of  detectors,  or  to  apply  10  and  assign  a  number  to  each  detec¬ 
tor,  one  must  specify  classes  of  pure-noise  and  noise-plus-signal 
cpf  s. 


Considerations  such  as  these  lead  one  to  consider  detectors 
whose  performances  are  the  same  for  "large"  classes  of  pure- 
noise  and  noise-plus-signal  cpf's.  Such  detectors  are  strongly 
distribution -free;  one  repeats  the  definition  of  Section  ID. 

11.  A  detector  is  called  an  SDF  detector  if  for  each  PFA  a, 
each  pure-noise  cpfFOJ  and  each  noise-plus-signal  cpf  F\, 
the  PFD,  /3  =  /?(F0,  Fi  )>  depends  only  on  the  function 

F0  ( F-[ 1 )  (for  all  strictly  monotone  continuous  cpf's  F0  and 

h  )■ 

12 

Very  recent  statistical  results  of  Bell  and  Doksum  can  be 
adapted  to  yield  two  additional  goodness  criteria  for  detectors. 

Let  \Fe\  be  a  class  of  noise-plus -signal  cpf's  such  that 
lim  F0  -  F0  =  F0 ,  the  pure-noise  cpf. 

12.  A  detector  D1  is  said  to  be  a  locally  minimum  (LM)  PFD  de¬ 
tector  for  the  class  \Fq  j  above,  if  there  exists  e  >  0  such 
that  the  PFD  /3(Z?X,  F0 )  -  P(D2,  F0  )  for  all  other  detectors 

L2  and  all  9  satisfying 

|0  -  0O  I  <  e. 

Quite  often  it  is  impossible  to  design  such  a  detector  and, 
for  a  given  class  }F0J  of  the  form  above,  one  may  settle  for  an 

13.  almost  locally  minimum  (ALM)  PFD  detector,  which  is  a 
detector  whose  YES-NO  decisions  are  based  on  a  statistic  T' 
satisfying:  for  every  positive  6  and  rj ,  there  exists  a  positive 
e  such  that  P0(|I"  -  T(0)|  >  v)  <  6  whenever  |0  -  0O|<  e, 
where,  for  each  9  ,  T (0)  is  the  statistic  of  the  uniformly 
minimum  (UM)  PFD  detector  for  pure-noise  cpf  F0  vs. 
noise-plus-signal  cpf  F0 . 


It  will  be  seen  in  Sections  II D1  and  IDE  1  that  for  SDF  de¬ 
tectors  and  families  of  noise-plus-signal  cpf's  satisfying  certain 
regularity  conditions,  one  can  develop  explicit  formulas  for  the 
LM  PFD  and  ALM  PFD  detectors. 

Before  beginning  a  detailed  study  of  the  detector  models  one 
should  mention  that  there  are  certain  practical  considerations, 
e.  g.  ,  those  of  equipment  design,  and  those  of  calculation  and  tabu¬ 
lation  of  the  appropriate  statistical  distributions,  which  lead  one  to 
consider  detectors  which  violate  some  of  the  goodness  criteria, 
i.  e. ,  detectors  which  are  less  "optimal"  and  more  "tractable.  " 

One  is  now  in  a  position  to  study  the  detector  models  in 
more  detail. 


PS  DATA 


SIGNAL 


TRANSMISSION 

^1,  •  •  *  ,F-n  ^ 

DETECTOR 

MEDIUM 

PURE-NOISE! 

NOISE 

CPF  F0 

KNOWN  j 

— O  YES  (SIGNAL  PRESENT) 
— O  NO  (SIGNAL  ABSENT) 


COMPARE  Xx,---,Xn  AND  FQ 


II.  MODEL  I  DETECTORS 


As  one  reealls  from  Section  I D,  a  Model  I  detector  is  one 
for  which  there  are  available  both  the  known  pure-noise  cpf  F0, 
and  PS  data  Xx,  X2,  •  •  • ,  Xn  .  The  decision-making  process  then 
consists  of  comparing  the  PS  data  and  F0  and  deciding  YES  iff  they 
are  in  some  sense  "too  far"  apart.  Further,  with  the  imposition 
of  the  very  reasonable  (Section  ID)  SDF  condition  one  finds  that  the 
SDF  Model  I  deteetor  must  base  its  YES-NO  decisions  on  statistics 
of  the  form  ip  [F0  ( Xx),  •  •  • ,  F0(Xr ,)]  . 

Following  the  "natural"  divisions  of  SDF  statistics  one  con¬ 
siders  three  classes  of  Model  I  detectors: 

1.  Sample  cpf  (SDF  Model  I)  detectors,  which  employ  statistics 
based  on  differences  of  the  PS  data  sample  cpf  Fn  and  the 
pure-noise  cpf  F0  ;  one  recalls  that  Fn  ( y )  =i  (number  of 
J’s  si/),  the  sample  proportion  of  X's  which  are  less  than 
or  equal  to  y. 

2.  Run -block  (SDF  Model  I)  detectors,  which  deal  with  the 
number  of  PS  data  values  Xx,  •  •  • ,  Xn  which  fall  in  certain 
preassigned  intervals  and/or  the  relative  spacing  of  these 
values;  and 

3.  Rank-sum  (SDF  Model  I)  detectors,  whose  decisions  are 
based  on  sums  of  percentile  ranks,  F0  (Jj),  of  the 

%i , *  * ‘ ,  xn  . 
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In  the  next  three  sections,  general  formulas  and  specific 
examples  of  the  common  types  of  Model  I  detectors  are  given.  In 
later  sections  the  various  detectors  are  compared  in  terms  of  the 
previously  mentioned  goodness  criteria  —  PFA  a,  PFD/3,  ARE, 
etc. 


A.  MODEL  I  SAMPLE  Cpf  DETECTORS 

The  detectors  discussed  in  this  section  are  primarily  based 
on  generalizations  and  specializations  of  three  well-known  distri¬ 
bution-free  statistics  whose  formulas  are  given  below. 


1.  Kolmogorov-Smirnov  Detector 

D(n,r,  i>)  =  sup  [  \Fn  (x)  -  Xc(.r)|  v/*  (F0(x)) 

x  1 


(1) 


The  usual  specializations  and  variations  of  this  statistic  are 
as  follows. 


Br=  SUP  [1^(3;)  -  F0(X)|] 

=  max  |  max  j  F0  ( X(i ))  ~  (*(*)) 


(2) 


where  X(l)  ^  X(2)  s  •■•  <  x(n)  are  the  ordered  values  of  the  PS 
data  Xj,  A  2  ,  •  •  • ,  Xn  ; 


The  two  one-sided  versions: 


D\  -  sup  -  X0(x)]  =  max  J-  -  F0  (X(i))j 


D~n  =  sup  [X0(x)  -  X-  (x)]=  max  F0  (X(i))  - 

i  1 


i- 1 


(3) 

(4) 
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and 


yn  ~  sup  (| Fn(x)  -  -F0(x)|  {(/0(x)]  [1  -  F0(x)]\  2) 


=  max  (J  —  -  F0(X(i))\  \F0(X{i))  [1  ~F0(X(i))]\  2J{ 5) 

which  is  designed  to  be  sensitive  to  deviations  in  the  tails  of  the 
pure-noise  cpf  F0. 


2.  Cramer-von  Mises  Detector 

W{n,r,*)=f\\Fn(x)-F0(x)\  v/^[f0(x)]|r  dF0(x)  (6) 

The  common  versions  of  this  statistic  are: 

OO 

nw\=n  f  [J„(X)  -  F0(x)f  d,F0(x) 


,  JL 

—  +  Y 

L2n  /_ 


2i-  1 


12  n  (  2  n 

i= l 


hum 


The  two  one-sided  versions 


■+n=  n  J  [^(x)  -  F0(X)]  dF0(x) 


W~n  ~  n 


J  (/o(^)  “  ^.(x)]  dF0(x) 


are  related  by  W+n  =  -W  7X  =  1 1  ~  ^  ^  l’0(Zi)j  and,  hence,  are 

both  equivalent  to  the  rank-sum  statistic  7  =-  ^  F0  (Xrf,  which 


is  treated  in  the  following  section. 


i 


GO 


i V2n=  n 


f  [Fn(x)  -  F0(;r)]2 


F0{x)  (1  -  F0(x))\ 


dF0(x) 


1  n 

=  ~n  ~~  ^  |(2i-l)  In  F0(X(i>)  +  [2(n-i) 
i=l 

+  l]  ln[l  -  F0(J(i))]|  (10) 


which  corresponds  to  D'n  above  and  is  designed  to  be  sensitive  to 
deviations  in  the  tails  of  the  pure-noise  cpf  F0. 


On  examining  the  preceding  formulas  one  sees  that  these 
statistics  are  simply  generalized  distances  between  the  PS  data 
sample  cpf  Fn  and  the  pure-noise  cpf  F0 .  The  Kolmogorov-Smirnov 
class  of  detectors  is  concerned  with  the  maximum  of  certain  func¬ 
tions  of  the  differences  Fn  -  F0  ,  while  the  Cramer-von  Mises  class 
of  detectors  base  their  decisions  on  the  integrals  of  certain  func¬ 
tions  of  Fn  -  F0  .  The  one-sided  versions  of  both  classes  of  statis¬ 
tics  are  natural  to  consider  if  the  noise-plus-signal  cpf  F±  <  F0  or 
if  >  F0. 

_  i 

One  notes  that  the  function  j  F0(x)  [1  -  {  diverges 

as  x  tends  to  ±  00 ,  i.e. ,  as  F0(x)  tends  to  0  or  1.  For  this  reason 

|#,(;r)  -  F0{x)  |  t1  “  F0(x)l  j 

will  tend  to  be  large  if  the  absolute  difference  |Fn(x)  -  F0(ar)|  is 
large  for  very  small  x  or  for  very  large  x,  i.e. ,  differences  in 
the  tails  of  F0(x)  are  magnified.  Hence,  detectors  based  on  equa¬ 
tions  5  or  10  are  sensitive  to  noise -plus-signal  cpf's  F1  which 
differ  from  the  pure-noise  cpf  Fa  in  the  tails. 
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Fix)  F(x) 


3.  Sign-Quantile  Detector 

Before  giving  numerical  examples  of  the  use  of  the  detectors 
just  described,  one  should  consider  the  simplest  sample-cpf  detec¬ 
tor,  which  is  based  on  the  sign  statistic  and  is  historically,  per¬ 
haps,  the  first  distribution-free  statistic. 

In  its  rudimentary  form  a  sign  detector  (sometimes  called 
a  threshold  detector)  is  one  which  decides  YES  if  the  number  Q'  of 
PS  data  observations  Jx,  •  •  • ,  Xn  which  fall  below  some  preassigned 
threshold  iP{F0)  =  F^\p)  is  "too  large.  ’’  That  is, 

n  n 

Q'  =  2  e  CM* o)  "^]  =  X  e  &  ~  W>] 

i=l  i=l 


where  e  is  the  degenerate  cpf  such  that 


e(z)  = 


0  if  s  <  0 
1  if  2  SO 


A  cursory  examination  of  the  situation  indicates  that  in  the 
pure-noise  situation  approximately  lOOp  per  cent  of  the  Xlt  •  •  •  ,  Xn 
will  be  equal  to  or  below  the  threshold  £.p{F0);  and,  of  course,  ap¬ 
proximately  100(1  -p)  per  cent  of  these  observations  will  exceed 
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£p(F0).  Consequently,  Q'  will  be  too  large  if 

Foible))  “  Fn(tP(F0))  =  p~  F^FoHp)) 

is  too  small,  since  nFn  (Fq\p)^  is  the  number  of  Z1,  •  •  • ,  Xn  which 
are  less  than  or  equal  to  ZP(F0)  =  Fq1(p),  the  pthquantile  of  F0  (see 
illustration,  A). 

The  sign  detector,  although  it  will  be  seen  to  have  certain 
good  properties,  does  not  provide  a  desirable  situation  for  dis¬ 
tinguishing  between  pure  noise  and  noise-plus-signal  if  i’j  '\p)  =  Fo\p). 
In  both  the  pure  noise  and  noise -plus -signal  cases,  the  J's  will  then 
fall  below  the  threshold  with  the  same  probability,  and  elementary 
calculations  with  show  that  a  +  (3  =  1,  i.  e.  PFA  +  PFD  =  1  (see 
illustration,  B). 


F(x)  A.  F(x)  B. 


This  leads  one  to  consider  de¬ 
tectors  whose  decisions  are 
based  not  on  a  single  threshold 
tP{F0)  =  Fo\p),  but  on  "thresh¬ 
olds"  £7l(J0),  Zq2(F0),--  ■, 
as  diagrammed. 
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Let  0  =  qo  <  q1  <  q2  <  •  •  •  <  qs  2  <  qr  =  1  be  an  arbitrary 
finite  sequence  of  probabilities.  Then 


(^(go).  ^(q^^Fo-Hq,),  F~\q2) 


is  a  partition  of  the  real  numbers  into  s  nonoverlapping  intervals 
on  the  real  line.  If  there  is  only  pure  noise,  i.  e.  ,  if  F  -  F0,  then 
for  the  ith  interval  Jt  =  -FtT1(gj,)  ] , 


1.  its  probability  is  PFo(Jt)  =  qt  -  qui  =  Pt 

2.  the  expected  number  of  PS  data  points  Xlt  •••,  Xn  to  fall  in 
the  interval  is  n(qi  -  qi_i)  =  npt 

3.  the  actual  number  of  PS  data  points  in  the  ith  interval 

m  =  n^Fn  (  r;\q.))  -  (%_!>)]  •»  and 

4.  each  of  quantities  |  (<?;))  -  - 

(9i  "  9j_i)  |  =  |"n  ~  p  |  is  in  some  sense  a  measure  of  the 
deviation  of  the  PS  sample  Xlt’  •  • ,  Xn  from  F0 . 


For  these  reasons  one  is  led  to  the  following  generalization 
of  the  original  sign  statistic. 

For  the  sign-quantile  detector, 


Q  (fii  ’ »  9s_i>  r ; 


where  for  i  =  1,  2,  •  •  • ,  s  , 

rn  =  n  [-% (Vo-1  (o,) )  -  Fn (To-1  ( g,_j) )  j  and  p.  =  q.  -  q._v 
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On  setting  s  =  2,  q1  =  p,  r  =  1  and  4'  =  1,  one  obtains 


a.  TWO-SIDED  SIGN  DETECTOR 


Q(n ;  p;  1;  1)  =  2 


n  l 

1  ” 

~n  -P 

=  2 

-  y  e[p- - p 

i=l 

=  f  l<2’  -"Pi  (12) 

where  <2'  and  e  are,  respectively,  the  sign  statistic  and  degenerate 
cpf  defined  above  and  given  below. 


b.  SIGN  DETECTOR 

n  n 

Q'p=  E  Wt)]  =  ^  €CM^o)  -  xt]  =  n\  (is) 

i  =  l  i= 1 

(Note:  When  p  -  0.50,  <2'50  is  sometimes  referred  to  as  the 
median  statistic.) 

The  most  common  statistic  for  8-2,  i.e. ,  for  more  than 
one  "threshold,  "  is  the  following,  which  is  sometimes  called  a 
"chi-square"  statistic  because  of  its  limiting  distribution. 


c.  EXTENDED  (MEDIAN)  DETECTOR 


«-x 

i  =  l 

Q(n;q1,"  ’ ,  qs_v 


,  \2 
(nt  ~  np.) 

npi 

r\  4 > )  for 


,  which  is  the  specialization  of 
which  r  =  2  and  ^  (u,  v)  =  \ln  /u. 


(14) 


In  the  event  that  pf  =  1/s  or,  equivalently,  qi  =  i/s,  then 

one  has 

S  S  2 

=  HI  S  (snrn)2  =f  ^  (nt-  j)  (15) 

i— 1  i=l  '  ' 

From  one  point  of  view  the  extended  median  Q  gives  undue 
weight  to  intervals  (ir0_1(gri_1),  ^o"1^)]  =  (iq^C^o).  ^(^o)]  for 
which  p  =  q.  -  g^is  small  because  of  the  denominator  npt.  In  an 
attempt  to  adjust  this  situation  one  considers  a  statistic  Q  with 
r  =  2,  and  v^(u,  v)  =(s[u  +  \[U  )" 


d.  MATUSITA  DETECTOR 


i=l 


nJn  + 


s 

nT^.)  =  4n  ^  | N/n,;/n  -  p .  |  • 

i- 1  L 


lrii/n  +  \Tp.  j  ~  8n  -  8*Jn  ^\/ni  Pi 


which  reduces  in  the  case  p,=  l/s  to 


§s  =  8n  1-  (ns) 


'Vni 


(16) 


(17) 


Of  course,  the  twelve  or  so  types  of  Model  I  sample  cpf  de¬ 
tectors  above  do  not  in  any  sense  exhaust  all  of  the  possibilities. 
However,  they  do  give  a  more  or  less  comprehensive  picture  of 
the  sample  cpf  statistics  in  current  use  and,  hence,  of  Model  I 
sample  cpf  detectors  which  should  be  considered. 

Examples  illustrating  the  computations  involved  in  the  use 
of  several  of  these  statistics  are  given  below.  Since  the  normal 
distribution  and  the  Rayleigh  distribution  are,  of  course,  two  of  the 
more  common  noise  distributions  studied  in  signal  detection,  the 
use  of  the  Kolmogorov-Smirnov  and  Cramer-von  Mises  detectors 
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will  be  illustrated  for  a  pure-noise  cpf  Fa  which  is  N(0, 1),  i.e. , 
normal  with  /u  =  0  and  a2-  1;  and  the  sign-quantile  tests  will  be  il¬ 
lustrated  for  a  pure-noise  cpf  F0  which  is  standard  Rayleigh,  i.e. , 
F0(x)  =  1  -  cxp(-x2)  for  r^O. 

Example  2.  Consider  the  following  Model  I  detection 
situation. 

Pure-noise  cpf:  Standard  Rayleigh,  Fa(x)  =  1  -exp(-.r2)(.r  s; 

PFA  a  =  .  05 

PS  data:  1 . 3,  .  3,  .  2,  1. 9,  .  8  n  =  5 

Dn  detector:  Decide  YES  iff 

max  j  max  [f(X(i))  \  ~  F0  (Z(i))]  j  >  .  5633 


nu2n  detector:  Decide  YES  iff 


1 

12n 


+ 


>  .46136 


(Note:  ".5633"  is  the  95th  percentile  of  the  Dn  distribution 
for  n-  5  obtained  from  ref.  19,  p.  431.  ".46136"  is  the  95th  per¬ 
centile  of  the  nw2n  distribution  obtained  from  A.  W.  Marshall's 
table,  ref.  20.) 


Computations 

Kolmogorov 

Cramer-von  Mises 

Basic  Computation 

-Smirnov  Computation 

Computations 

'N  Fo  {F(i))  lYlj\\  _  Lli  —  -  TP  (Yli\\ 

=  Fn(X(i ))  X(i)  1  -exp  [X2(  i)]  Fo(F(W  N  N  F0(X(i)) 


For  the  Bn  detector 

J}n  =  max  (.  04,  .16,  -.11,  .31,  .07,  .13,  .16,  .04,  .17, 
.03)  =  .31. 

Since  .  31  <  .5633,  the  Dn  detector  decides  NO. 

For  the  nco2n  detector 

nu2n  =  [(12)(5)]_1  +  (.  0036  +  .  0441  +  .  0009  +  .  0144  +  .  0049) 

=  . 0167  +  . 0679  =  . 0846  <  . 46136.  Hence,  the  ruo2n 
detector  decides  NO. 


PROB  ]Xi  ^  x 


The  approximate  value  of  Dn  can  be  obtained  from  the  graphs 
of  Fn  and  F0  in  the  following  figure,  since  Dn  is  the  maximum  of  the 
absolute  value  of  the  differences  JV(.r)  -  FQ(x).  From  the  graphs 
one  can  see  that  the  maximum  difference  is  attained  fora:  =  .3, 


and  is  Fn  ( .  3)  -  F0  ( .  3) 


4  - 


1  -  exp  [-(.  3)2]  |  =  .31. 


Rayleigh  cpf  Fa  and  sample  cpf  F„  (see  example  2,  p.  28). 


Note  also  that  from  the  table  above  the  values  of  several  of 
the  other  sample -cpf  detector  statistics  can  be  computed,  as  follows: 

From  the  sixth  column, 


D+„  =  max 


F0(X(i)) 


=  max(.16,  .31,  .13,  .04,  .03)=. 31 


From  the  fifth  column, 


B  ~n  -  max 


f0  Cr(i))  - 


i  -  1 


n 


=  max  (.  04,  -.  11,  .  07, 


.16,  .17)  =  .17 
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From  the  fourth  column, 

n 

W+n=~  ~  7  F  (X(i))  =  |  -  (.  04  +  .  09  +  .  47  +  .  82  +  .  97) 

2  i  2 

i=l 

=  2.  50  -  2.39  =  .11,  and 
W~n=  -  W+n=  -.11. 

For  an  illustration  of  the  use  of  the  sign-quantile  detectors 
one  can  consider  the  following  example. 

Example  3- 

Pure -noise  cpf:  F0  isj\r(0,l)  (See  figure  below,  in  which 
F0  and  Fn  are  plotted. ) 


N(0, 1)  cpf  Fq  and  sample  cpf  Fn  (see  example  3,  p.  31). 


♦ 


I 
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PFA  a  =  .  10 


n  =  20 


20 

A 

(a)  §'50  detector:  Decide  YES  iff  / 

e(0  -  Xt)  ^  6 

i  =1 

20 

(b)  Q 1 75  detector:  Decide  YES  iff  / 

e  (.675  -  Xi)  ^  12 

i=1 

4 

(c)  Q4  detector:  Decide  YES  iff  [(20)(4)]~1'>  (4rc  j  -  20)  2:  6.  25 

i=l 

[  1  ^ 

(d)  Q 4  detector:  Decide  YES  iff  8(20) 

1-  [(20) (4)] ~ 2  >  \JK7 

—  6.  25 

\  i= 1 

PS  data:  .06,  .40,  .09,  1.58,  -.27,  -1.15,  1.68,  -.05,  -1.55, 
1.54,  .55,  2.38,  .81,  -.46,  .32,  1.02,  -.91,  -.78, 

.50,  .11. 

Computations:  £.25(^0)  =  ~-675‘>  £50(^0)  =  0.00;  £75(F0)  =  .675 


Computations 


i  X(i ) 


1 

-1.55 

2 

-1.  15 

- 

3 

-  .91 

- 

4 

-  .  78 

- 

5 

-  .46 

- 

6 

-  .27 

- 

7 

-  .05 

8 

.06 

+  ' 

9 

.  09 

+ 

10 

.  11 

+ 

11 

.  32 

+ 

12 

.40 

+ 

13 

.50 

+ 

14 

.55 

+ 

15 

.  81 

+ 

16 

1.  02 

+ 

17 

1. 54 

+ 

18 

1.  58 

+ 

19 

1.68 

+ 

20 

2.38 

+ 

for  C.’so 

•  7  =  0' 

V.50 

4  50(^0)  =  00° 
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Computations  Computations 
for  0;75  for  O4.  C4 


n,=  4 


:| "" 

2^.25^)  =  --675 

2  l  n2=  3 


3 
3 
3 
3 
3 
3 

3 

^>=.675  -y 

4 

: 

4 
4 


0)  =  0.00 


<75(^0)  =  -  675 


n4=  6 
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(a)  For  the  $'50  detector:  <2 '50  =  7  >  6  and  the  50  detector 

decides  NO. 

(b)  For  the  Q \  75  detector:  <2!  75  =  14  >  12  and  the  <2'.  75  decides  NO. 

(c)  For  the  C4  detector:  <24  =  (80)"1  [(16-20)2  +  (12-20)2  +  (28-20)2 

+  (24-20)2  ]  =  2.  00  <  6.  25  and  the  <24 
detector  decides  NO. 

(d)  For  the  Q4  detector:  Q4  =  (160)  [l  -  (80)"  ^  +  \l~3  +  \f ~7 

=  4.  00  <  6.  25  and  the  Q4  detector  decides 
NO. 

In  closing  this  section  one  should  mention  two  pertinent 

facts: 

1.  In  none  of  the  cases  in  Example  3  (p.  31)  was  the  exact 
PFA  a  attained,  since  the  exact  distributions  of  the  statistics  in¬ 
volved  are  given  by: 

Theorem  4.  If  F  =  F0 ,  (i)  Q'  has  binomial  distribution 
with  parameters  n  and p;  and  (ii)  as  n  tends  to  infinity,  Qs  and  Q 
both  have  asymptotically  a  chi-square  distribution  with  (s-1) 
degrees  of  freedom. 

The  "6"  and  "12"  for  §'.50  and  Q\  75,  provide  a's  of  .0577 
and  .  1018,  respectively;  &  =  .  10  is  attainable  with  these  sign  sta¬ 
tistics  only  through  randomization.  Further,  for  n=  20,  the  "6.25" 
is  only  an  approximation  to  the  95th  percentile  of  Q4  and  Q4  distri¬ 
butions.  A  summary  of  the  distribution  situation  is  presented  at 
the  end  of  this  section. 

2.  Up  to  this  point  nothing  has  been  developed  which  would 
allow  one  to  objectively  express  a  preference  of  detectors  among 
those  already  considered.  This  aspect  of  the  detection  problem 
will  be  discussed  in  Section  D. 


B.  MODEL  I  RUN-BLOCK  DETECTORS 


The  detectors  of  this  type  are  based  on  the  spacings  of  the 
PS  data  relative  to  the  pure-noise  cpf  F0  . 

It  is  well-known  that 

Theorem  5.  If  0  =  qo  <  q1  <  •  •  •  <  gs_1  <  qs  =1,  and  there 
are  n  PS  data  observations, 

(i)  the  expected  number  of  observations  falling  in  the  interval  or 

cel1  (*9 »  Wo)]  =  ( ^o_1  ( _! ) * ^0-1(<7i ) ]  is 

n(qrqul)=np.  and 

(ii)  for  integers  k  <  r,  the  expected  value  of  the  difference  of 
"heights"  F0  (X(r ))  -  F0(X(k))  is  ^fj0(Z(r))  -  fo  (*<*»]  = 
r-k 

n  + 1 

Hence,  when  the  (n  +  l^.  are  integers,  then  each  of  the 
quantities  | F0  (j[(rz  +  l)  qi  ])-  [(n+1)  gi_1  ]^—  pi  j  is  in  some 

sense  a  measure  of  the  deviation  of  the  PS  data  from  F0 .  This  is 
also  true  for  the  number  E  of  empty  cells  among  the  <?■_,), 

F0 _1(<?£)  .  One  is  therefore  led  to  detectors  based  on  the  following 
statistics. 

1.  Spacing  Detector 

s 

S(n,qx,-”  ,qs_{,r,  *)  =  J  [|^o(^[(n+1)9i  ])"  iro(^[(n  +  l)gi_1 

i=l  L 

-  p.  |  'P (p,. )  j  (18) 

where  each  (n+ljg,.  is  an  integer;  0  =  q°  <  g1<  •  •  •  <  g  <  gq  =  1; 

Pi  ~  ~  <?,-_! ,  X  (0)  =  -  =o  and  X(  n+ 1)  =  «> 

The  most  common  versions  of  this  detector  are  based  on 
the  following  statistics. 
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o.  SHERMAN  DETECTOR 


n+1 

»  "  s(n;  Si1'"’  J  |?o<W»-  W<‘-D) 


b.  KIMBALL-MORAN  DETECTOR 


/  1 

1  Til  - ,  *  •  *  , 

—  ;2;0 

n+1  , 
-2 

\  ’  n+1’ 

n+1’  ’  ) 

.L-i  I 

7.  =1  L 

-i  2  n+1 

1/  — *1 


The  following  example  illustrates  the  computations  involved. 

2 

Example  4.  Pure-noise  cpf:  Fa  (x)  —  1  -  exp(-x/4)  (x  -0), 
PFA  a  =  .  01,  n  =  5 

PS  data:  1.6,  3.8,  .4,  2.6,  .6 

Sn  detector:  Decide  YES  iff  Sn  >  0.57442* 

S'n  detector:  Decide  YES  iff  S'n  >  s(5,  .01) 


*  "0.57442”  is  the  99th  percentile  of  the  Sn  distribution  for  n  =  5,  from  ref.  21,  p.  448- 
As  of  this  writing  there  ore  no  published  tables  of  the  distribution  of  S'n  ,  although  such 
tables  can  be  constructed  with  the  aid  of  high-speed  computers.  Hence,  s  (5,  .01)  is  not 
known. 


Computations 


For  Sn 

For  S'n 

F0(X(i))~  F0(X( 

~n+ 1~I>1 

Mxii))  - 

i 

X(i) 

F0(X(i)) 

F0(X(i-l))Y 

0 

—  oo 

0 

1 

0.4 

.04 

iH 

o 

o 

=  .13 

.  0016 

2 

0.6 

.09 

|. 09  -  .04  -  .17 1 

=  .12 

.0025 

3 

•  1.6 

.47 

|.47  -  .09  -  .17 1 

=  .21 

.1444 

4 

2.6 

.82 

.82  -  .47  -  .  17 1 

00 

H 

II 

.1225 

5 

3.8 

.97 

|  .97  -  .82  -  . 17 1 

=  .02 

.0225 

6 

CO 

1.00 

|l.00  -  .97  -  . 17 1 

*  .14 

.0009 

.80 

.2944 

Therefore  Sn  -  •  80  and  S'n  =  .  29  -  .  17  -  .  12 

The  3n  detector  then  decides  YES,  since  .80  >  0.57442. 


The  decision  of  the  S'n  detector  depends  on  the  unknown 
s(5,  .01). 

2.  Empty-Cell  Detector 

The  second  type  of  Model  I  run -block  detector  is  the  empty- 
cell  detector,  which,  as  its  name  suggests,  is  based  on  the  number 
of  the  cells  _1  (q  f-1  ),  F0~l  (qt  )]  which  are  empty.  (For  this 
detector  it  is  convenient  to  let  s  +  1  denote  the  number  of  cells.) 

Before  giving  the  general  formula  for  the  statistic  involved, 
one  should  note  that 

1.  71 1  =  n  [-?ti  (/o'1  (<?;))  ~  Fn  (•f’o"1  (9^! ))]  is  the  number 
of  PS  observations  in  the  £th  cell  J i  =  (Fq-1  (q  i_1), 

V<gt)|. 


2.  e[jn  (F-\gi))  -  FnfoHqt-i))- ]  = 


0  if  Jj  is  empty 
1  if  Jj  is  occupied, 


1  if  s  a  o 

where  e(s)=  ,  the  degenerate  cpf. 

0  if  s  <  0 


3.  X  e  I Fn  (Fo\<li))  ~  ^(^o_1  (%_!>)  -  ^ ]  is  the  number  of 
i=l 

occupied  cells;  and  since  s  division  points  ir0”1((?1)  <  * ' '  < 
give  (s  +  1)  cells  J> 


4.  (s 


s+1 

+1)  -  ^  e  ]  is  the 


number  of  empty  cells. 


Consequently,  one  defines  for  the  empty-cell  detector 


E  (n;  qv 


d  '  -L 

9S)  =  (s+1)  -  S  ekfar1^)) 
.*  1  * 


One  usually  chooses  the  cells  to  be  equiprobable,  i.  e. , 

q.  =  —  and  p.  =— r  ;  and  works  either  with  the  statistic 
^  s+l  s+1 


O  ■  j- 

Kr  <s+1>  -  £  * -  rn(r,-i£))  -  +  ]  P2) 

1=1  L 

or  with  the  equivalent  statistic 
s+1 

Kn  *  2  ^  [?„  (k'(±))  -  (i^O)  -  +  ]  (22) 


which  is  the  number  of  occupied  equiprobable  cells. 
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Since  the  distribution  of  E  is  the  one  usually  tabulated  (see 

A 

ref.  21,  p.  454  ff),  the  detection  example  below  will  use  E . 

Example  5. 

Pure -noise  cpf:  F0  is  standard  Rayleigh,  F0(x)  =  1  - 
exp(-.r2)  (x  2:  0). 

PFA  a  =  .  01 

PS  data:  1.66,  2.06,  3.16,  1.69,  1.33,  .45,  3.26,  1.55, 
.05,  3.14,  2.15,  3.98,  2.41,  1.14,  1.92,  2.62, 
.79,  .82,  1.71,  2.10 

A  /A 

E10  detector:  Decide  YES  iff  E10  ^  6  (i.e. ,  if  six  or  fewer 
of  the  ten  equiprobable  cells  are  occupied). 

Computations 

To  find  the  boundary  points  of  the  ten  equiprobable  intervals 
or  cells  one  must  solve 

1 

1  -  exp(-x2)  =p  or  x  =  [-ln(l-p)] 2  for  p  =  .  1,  .  2,  •  •  • ,  .9. 

The  cells  and  the  number  of  PS-data  points  falling  in  them 
are  as  follows: 


i 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1 

0, .325 

.  325, . 472 

.472,.  597 

.957,1.097 

1.097,1.269 

1.  269,1.517 

1.517,  °° 

A 

1 

1 

0 

0 

2 

0 

0 

1 

1 

15 

Since  four  cells  are  empty,  E1Q  -  6  and  the  ■S’lo  detector 
decides  YES. 

In  closing  this  section  one  should  note 

1.  that  the  exact  distribution  of  E  and  the  limiting  distribution 
of  Sherman’s  Sn  are  given  by: 

If  F  =  FQ,  then 
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(a) 


P(ES+ iS  r)  = 


I 

7  =1 


and 

(b)  Sn  is  asymptotically  N(^n,  <Jn 2) 

where  \in  =  [n/(n+l)]n+1  which  tends  to  l/e  and 

an2  =  \_2nn+2+  n(n-l)”+2]  [(n+2)  (n+l)n+2  J"1-  [ ri/(n+l)]2n+ 2 


which  is  approximated  by  (2e-5)/ne2; 


2.  that  in  view  of  the  discreteness  of  the  statistic,  only  a  few 
significance  levels  can  be  obtained  exactly; 

3.  that  there  is  a  need  for  a  more  extensive  tabulation  of  the 
statistic  E\  and 

4.  that  there  is  a  need  for  a  "rule-of-thumb"  for  choosing  the 
number  s  (of  division  points  or  (s+1)  of  cells)  for  a  given 
PS  data  sample  size  n  in  order  to  attain  some  preassigned 
goodness  criteria. 

In  order  to  circumvent  the  difficulties  above,  one  can  make 
use  of  the  Model  I  rank-sum  detectors  of  the  next  section. 


C.  MODEL  I  RANK-SUM  DETECTORS 


The  detectors  of  this  section  are  based  on  statistics  of  the 

form 


n 

i=l 


(24) 
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where  H  is  a  strictly  increasing  continuous  cpf,  and  H~l is  its  in¬ 
verse.  At  first  glance  it  is  seen  that  K(H)  is  the  mean  of  the  trans¬ 
formed  observations  (Jj)],  —  ,  H~l\F0  (Xn)].  In  fact,  if  both 

H  and  Fa  are  normal,  exponential,  or  Rayleigh  cpf’s,  then  K(H)  is 
of  the  form  a  X+  ft,  i.e. ,  is  a  linear  function  of  the  sample  mean. 
(See  table  5. ) 


Table  5. 


H 

H~\u) 

F0 

H-1  [F0  Of)] 

K(H) 

N(nv  CTj2) 

°1$  V)  +  Pi 

where  4>  is  the 
^(0,1)  cpf . 

N(h2,  o-22) 

(§)  * 

(aJ.\x 

\2/ 

+  (*  -  T) 

l-e’^i 

-Ax  In  (1-u) 

2 

<W*2)  xt 

(Ai/A2)  J 

(^0) 

(x—  0) 

l_e-.r2Ai 

[-A:  ln(l-u)]* 

l-e”2/ 

J  M  2  Xi 

\J  Ai/A2  X 

(x  —  0) 

(^«) 

These  statistics  are,  then,  generalizations  of  the  sample 
mean  X;  and  one  might  expect  such  statistics  to  have  properties 
similar  to  the  properties  of  X  in  the  parametric  cases  even  when 
there  is  no  simple  relation  between  H  and  x0. 
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* 


F(x) 


Consider  the  situation  in 
which  the  pure-noise  cpf  F0  is 
above  the  noise-plus-signal 
cpf  Fy ,  or  equivalently,  Fy  is 
to  the  right  of  F0.  In  this  case, 
if  there  is  noise-plus-signal 
the  PS  data  values  Xy,  •  •  • ,  Xn 
will  "on  the  average"  be  larger 
than  if  there  is  only  pure  noise 
and  one  says  that  the  noise-plus- 
signal  values  are  stochastically 
larger.  Consequently,  if  there 
is  noise-plus-signal  the 
are  stochastically  larger  as  are 
the  {h~1  [F0  (J,-)]}  ,  and  K(H)  = 

—  'S'f~1[F0  (J^)]  .  Hence,  in  the 

case  outlined  above,  the  K(H) 
detector  would  decide  YES  iff 
K(H)  exceeded  a  level  deter¬ 
mined  by  n  and  a.  Of  course, 
the  reverse  decision  rule  would 
be  employed  by  the  detector  in 
guarding  against  noise-plus- 
signal  which  is  stochastically 
smaller. 


In  practice,  the  detectors  are  usually  based  on  one  of  the 
following  four  statistics: 

n 

(a)  tt  =  -2  ^  In  [F0(^>]  ;  (25) 

i= 1 

n 

(b)  tt'  =  -2  In  [1  -  F^Xj]  ;  (26) 

i=l 
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(27) 


n 

(°)  ^=~  S  Wi);and 

i=i 
n 

(d)  *'Vo(^)].  (28) 

i-  1 

Coincidentally,  these  four  statistics  result  from  choosing, 
as  H,  those  distributions  which  have  maximum  Shannon  information 
over  (a')  the  right  half-line;  (b  ')  the  left  half-line,  (c1)  the  unit  in¬ 
terval  and  (d')  the  whole  real  line,  respectively.  Namely,  (a")  the 
exponential  H(x)  =  1  -  e~x (x  —  0);  (b")  the  negative  exponential, 

H(x)  =  er,  x  <  0;  (c")  the  standard  uniform  #(;r)  =  x,  0  ^  x  ^  1  and 

x 

(d")  the  standard  normal  cpf  H(x)  =  f  — —  exp(-t2/2)  dt. 

n/"2tt 

Thus,  there  is  some  intuitive  justification  for  the  choice  of  these 
four  statistics,  although  they  seem  to  be  directed  toward  detection 
situations  in  which  F0  and  Fx  do  not  cross,  i.e.  ,  in  which  the 
pure-noise  distribution  is  stochastically  larger  or  stochastically 
smaller  than  the  noise-plus -signal  distribution. 


Example  6- 

Pure-noise  cpf:  F0(x )  =  1  -  exp(-.  0625  x2)  ( x  ^  0) 

PFA  a  -  .005 

PS  data:  3.2,  5.2,  1.2,  7.6,  .8  (n  =  5) 

(a)  tt  detector:  Decide  YES  iff 
5 

-2  ^  In  [^(X^)]  >  25.2  (the  99. 5th  percentile  of  the 
i-1  chi-square  distribution  with 

2 n  =  10  degrees  of  freedom). 
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(b)  tt*  detector:  Decide  YES  iff 


Wi)]  < 


(c)  U  detector:  Decide  YES 

i=l 

(where  .  751  is  the  99.  5th  percentile  of  the  distribution  of 
the  mean  of  a  sample  of  size  5  from  a  standard  uniform 
distribution). 

5 

(d)  Z  detector:  Decide  YES  iff  (.  2)  ^  4>  1[ir0  (X^)]  >1.117 

1 

(where  1.117  is  the  99.  5th  percentile  of  the  normal  distribu¬ 
tion  with  p  =  0  and  cr  2  =  .  2) . 


2. 16  (the  0. 5th  percentile  of  the 
chi-square  distribution 
with  2n  =  10  degrees  of 
freedom). 

5 

iff  (.2)  ^  F0{Xi)  >  .751 


Computations 


i 

X{i) 

F0  {XU))  = 

1-exp  [-.0625  J2(i)] 

In  F0(X (i)) 

S^l/o  (J(i))] 

In  [l-/0U(i))] 
=  -.  0625  X2(i) 

1 

.8 

.040 

-3.220 

-1. 750 

-  .040 

2 

1.2 

.086 

-2.450 

-1.365 

-  .090 

3 

3.2 

.457 

-  .782 

-  .108 

-  .640 

’  4 

5.2 

.  816 

-  .203 

+  .900 

-1.690 

5 

7.6 

.973 

-  .027 

+1. 927 

-3.610 

2.372 

-6.682 

-  .396 

-6. 070 
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^  F0(Xt)  =2.372 
In  F0(Xt)  =-6.682 
(Xi)]  =--396 

^  In  [1  -  X0(X;)]  =^[-.0625  X2(i)]  =  -6.070 

(a)  v  detector 
5 

-2  5!  ln  [X0(X;)]  =  13.364  *  25.2  Decision:  NO 

i=  1 


(b)  it'  detector 
5 


-2  I 


In  [1  -  F0{Xi)]  =12.140  {2.16  Decision:  NO 


i= 1 


(c)  U  detector 


(.2)  J  MX;)  =  .4 744  ^  .750  Decision:  NO 

i=l 


(d)  detector 


(-2)  X 


(.2)  £  *  ^(X;)]  =-.0792  *  1.117  Decision:  NO 

i=  1 

Very  recent  results12  indicate  that  the  statistics  of  the  form 

n 

/*"  (X;)]  are  really  special  cases  of  a  more  general  class 

i= 1 

of  statistics — the  "likelihood-ratio"  distribution-free  statistics. 
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They  are  of  the  form 


n 

Th  =  2  In  h'tfodd]  (29) 

i=l 

where  h  is  a  differentiable  strictly  monotone  cpf  on  the  unit  inter¬ 
val,  i.  e. ,  h  is  monotone  increasing  and  h( 0)  =  0,  h(l)  =  1,  and  of 
the  form 

r  =2j[jr0(j.)]  (30) 


3  i  .  . 

where  J'(u)  =  - — In  hj(u)  ,  and  \ha\  is  a  family  of  differ- 

3  CL  1  a=  a  0  '  * 

entiable,  strictly  increasing  cpf's  with  haQ(u )  =  u. 


Detectors  based  on  this  more  general  statistic  reduce  to  the 
it',  it,  U,  and  Z  detectors  when  h(u)  =  l-(l-u)2;  ua ;  aebu+  c  and 
^[<^~1(ii)  -  a],  respectively.  Of  course,  the  number  of  detectors 
of  this  more  general  type  is  unlimited.  However,  the  most  common 
version  other  than  the  tr,  tt',  U,  and  Z  detectors,  which  intuitively 
seem  good  for  the  stochastically  larger  and  stochastically  smaller 
cases,  is 


Z2  =  ^  (Zj)]  |  ,  which  results  for  h(u)  =  $ 

i=l 


$~V) 


(31) 


F(x) 


By  intuition  and/or  an 

investigation12  of  the  method  of 

deriving  this  formula,  one  con- 
_ o 

eludes  that  a  Z  detector  has 
some  optimal  properties  for 
situations  in  which  the  noise- 
plus-signal  cpf  F\  and  the  pure- 
noise  cpf  F0  have  different  dis¬ 
persions  or  spreads.  Exact 
methods  of  choosing  h  will  be 
discussed  in  the  section  on 
goodness  criteria.  This  section 
will  be  terminated  with  an 
example  of  the  application  of 
the  Z2  detector. 


Example  7. 


D.  GOODNESS  CRITERIA  FOR  MODEL  I  DETECTORS 


All  statistical  goodness  criteria  for  detectors  are  based 
directly  or  indirectly  on  the  PFAa  and  PFD  p.  Usually  one  fixes 
PFA  ci  and  is  consequently  led  to  criteria  which  (for  fixed  a)  de¬ 
pend  solely  on  the  PFD  p. 

Assuming  that  both  the  pure-noise  cpf  F0  and  the  noise- 
plus -signal  cpf  Fi  are  normal,  one  finds  that  for  SDF  detectors  p 
depends  only  on  cr0 /<j1  and  -/HqJ/ctj  ,  i.  e. ,  the  ratio  of  the  stand¬ 
ard  deviations  and  the  ratio  of  the  difference  of  means  to  u1 . 

Hence,  P  can  be  completely  tabulated  subject  only  to  limitations  of 
computation  time  and  costs. 

The  detectors  being  considered  here  are  all  SDF  and,  hence, 
P  depends  only  on  FqF^1  =  (j^y1)-1.  However,  even  with  this  sig¬ 
nificant  reduction  of  the  possible  alternatives,  it  is  impossible  to 
tabulate  or  give  formulae  for  all  situations  of  interest.  Conse¬ 
quently,  a  statement  of  the  form  "The  T1  detector  is  better  than  the 
T2  detector"  cannot  be  made  without  some  rather  severe  qualifica¬ 
tions. 


In  order  to  make  some  recommendations  as  to  the  relative 
merits  of  Model  I  detectors,  one  must  restrict  consideration  to 
certain  special  classes  of  alternatives,  as  follows. 


1.  One-sided  Bands  and  max-min  PFD  ft 


F{x) 


Based  on  the  statistical 
work  of  Chapman9  one  considers 
for  each  strictly  increasing 
continuous  pure  noise  cpf  F0 
the  class  9  ( F0  ,  A)  of  all  noise- 
plus-signal  cpf'sFi  such  that 

(a)  F0  >  F1  and 

(b)  max  [F0(x)  -  Ji(j:)]=  A, 

x 

where  0  <  A  <  1. 


For  the  class  Q  (F0,  A),  one  says  a  Tx  detector  is  max-min 
PFD  better  than  a  T2  detector  if  (3  A)  <  (3  (T2,  A)  and  §_  {Tlt  A) 

<  §_(T2,  A)  where  /3  and  /3  are,  respectively,  the  supremum  and 
infimum  of  the  indicated  detectors  for  the  class  9  ( F0 ,  A). 

The  following  curves  contain  slots  of  j3  vs.  A  and]?  vs.  A 
for  the  Model  I  detectors  based  on  the  statistics  tt,  tt',  D~n  ,  nu2n, 

U ,  Q\ 2  ,  9*5  »  and  9'g  . 

The  maximum  PFD's,  the  /3 T s,  for  the  statistics  Q'p  are  all 
(3  (Q'p,  A)  =  1  -  a  and  are  not  plotted.  Also,  on  these  graphs, 
which  are  plotted  for  a  =  .  01  and  .  05,  and  n  =  50  and  100,  there  is 
a  single  graph  labelled  "X, "  which  is  a  plot  of  PFD  [3  vs.  A  for  the 
parametric  case  in  which  both  the  pure-noise  cpf  F0  and  the  noise- 
plus-signal  cpf  F1  are  normal  cpf’s  with  equal  variances  and  for 
which  (a)  and  (b)  above  arc  satisfied.  All  the  other  graphs  are 
completely  DF,  in  the  sense  that  they  are  valid  for  any  strictly  in¬ 
creasing  continuous  cpf  F0  and  its  alternative  class  Q(F0,  A). 

From  these  graphs  one  concludes  that 

1.  there  is  a  general  tendency  for  detectors  with  high  maximum 
PFD's  to  have  low  minimum  PFD's,  and  vice  versa;  hence, 

2.  no  one  of  the  Model  I  detectors  based  on  tt,  tt',  D~n  ,  nw2n, 

U,  Q\2  ,  9!5  ,  9.'a  is  max-min  better  than  any  other  one 
for  all  of  the  cases  considered. 

This  result  is  somewhat  discouraging  to  one  who  wishes  to 
order  the  detectors  as  to  goodness.  It  is  further  discouraging  to 
note  that  as  of  this  writing  there  are  not  available  max-min  PFD 
data  for  the  other  Model  I  detectors  being  studied,  Dn,  W2n,  Q  , 

D~n  ,  Qs,  Z,  Z  ,  etc.  One  therefore  seeks  other  criteria. 


Maximum  and  minimum  PFD  for  one-sided  noise. 


(Continued) 


51 


MM 


2.  PFD  for  Rayleigh  Alternatives 


(x)  =  1  -  exp  (-.r2) 


- F-l  (x)  =  1  -  exp  ( -x2/6 ) 

/ 

- ►  x 


Some  engineers  have 
found  that  Rayleigh-type  dis¬ 
tributions  occur  frequently  in 
practice.  Consequently,  it  is 
not  unreasonable  to  consider 
the  performance  of  Model  I  de¬ 
tectors  in  the  situation  for 
which  F0  (x)  -  1  -  exp  (-x2)  and 
F1(x)  *  1  -  exp  (~x2/e  ),  as 
illustrated.  (A  more  accurate 
plot  of  F0  is  given  on  page  30. ) 


Since  the  mathematical  expressions  for  exact  PFD  are  al¬ 
most  completely  intractable,  one  has  to  resort  to  Monte  Carlo 
methods  (10,  000  trials)  to  obtain  accurate  estimates  of  the  PFD's 
except  for  the  Qx  detectors  in  the  plots  of  /3  vs.  n  for  fixed  values  6  , 
where  only  the  values  for  which  6  ~  2  and  4,  and  n  =  5,  10,  15,  and 
20  are  computed.  The  curve  labeled  5'min/3  represents  the  value  of 
/3  for  the  optimal  choice  of  the  quantile.  Further,  one  might  in¬ 
tuitively  divide  the  graphs  into  two  sets — one  for  those  detectors 
which  are  based  on  the  "one-sided"  statistics  D~n,  it,  tt'  and  U  and 
the  other  for  those  detectors  based  on  the  "two-sided"  statistics 
Dn,  n w2n,  Sn  ,  W2n,  and  Fs+V  This  latter  division  of  the  class  of 
detectors  was  made  because,  prior  to  making  the  computations,  it 
seemed  intuitively  clear  that  the  "one-sided"  detectors  should  have, 
in  general,  better  performances  against  the  one-sided  alternatives 
1  -  exp  (-;r2/2)  and  1  -  exp  (-x2/4).  This  suspicion  is,  however, 
not  borne  out  by  the  plots  of  the  Monte  Carlo  computations. 
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From  these  plots  one  can  make  the  following  general  state¬ 
ments  in  which  "  j"  indicates  "generally  better  than"  for  the  cases 
considered. 

(a)  For  the  "two-sided"  detectors 


(b)  For  the  "one-sided"  detectors 
tt'  i  U  j  D~n  |  tt  j  C|5 

(c)  There  is  a  significant  overlap  of  the  ranges  of  PFD  for  the 
two  sets  of  detectors. 

(d)  Of  the  detectors  studied  for  the  cases  considered,  tt'  is 
clearly  the  best  and  Es+J  is  clearly  the  worst. 

These  results,  of  course,  depend  very  heavily  on  the  class 
of  alternatives  considered,  and  the  strongest  statement  one  can 
make  on  the  basis  of  these  results  is  the  following. 

(e)  If  one  wishes  to  use  a  Model  I  detector  which  (besides  its 
DF  property)  performs  well  against  Rayleigh  alternatives, 
one  should  choose  a  tt'  detector  and  avoid  using  an  Es+i 
detector. 

In  order  to  make  comparable  statements  concerning  other 
classes  of  pure  noise  and  noise-plus-signal  cpf's  one  would  have 
to  make  comparable  computations.  Of  course,  the  amount  of  such 
computation  possible  is  only  limited  by  time  and  money  considera¬ 
tions. 


3.  Asymptotic  Relative  Efficiency  (ARE) 


The  next  comparison  of  Model  I  detectors  will  be  made  on 
the  basis  of  ARE.  One  recalls  that,  roughly,  the  ARE,  A(Tlt  Tz), 
of  a  Tj  detector  with  respect  to  a  T2  detector  is  the  limit, 


lim 


n(T2) 

n{T j)’ 


of  the  ratio  of  sample  sizes  for  fixed  o:  and  P  in  the 


presence  of  increasingly  weak  signals.  Of  course,  as  the  signal 
becomes  increasingly  weak,  the  noise-plus-signal  cpf  F±  tends  to 
F0  ;  and,  as  one  might  suspect,  the  limit,  A(T,t  T2),  is  sensitive 
to  the  form  of  the  Fa  and  the  manner  in  which  the  F^ s  tend  to  F0  . 

Another  consideration  of 
great  practical  importance  is 
the  fact  that  in  a  large  number 
of  cases  of  interest  the  deriva¬ 
tion  of  a  computable  expression 
for  ARE  is  almost  impossible. 
This  situation  is  reflected  in 
table  6,  which  treats  only  de¬ 
tectors  based  on  Z,  U,  tt,  and 
-it',  translation  alternatives  (as 
illustrated  here),  and  certain 
classes  of  pure-noise  cpf's  F0- 
Several  numerical  entries  are 
missing  because  they  have  not 
yet  been  computed. 

Table  6.  ARE's  for  Model  I  Detectors 


F  Or) 


Fg  (x)  =  F o  (x-0) 


Pure -noise 
cpr  f0(x) 

N  oi  se-plus  -signal 

F0  where  0  \  0  j  U  ) 

A(Z,n)  | 

A(Z,  tt') 

(7T ,  U  ) 

A  (tt'.  U) 

/l(ir,  tt') 

H?) 

(Normal) 

*  ( X-9 ) 

1.05 

Uo 

(Uniform) 

Fo(z)  =r~9 

0^i<  1+0 

oo 

OO 

oo 

1.00 

Ho 

(Exponential) 

F0  (x)  =  1-e"1** 

9  £  x 

OO 

oo 

.33 

er 

150 

(Negative  exponential) 

ex~s 

.33 

(l  +  e‘r)  ’ 

(Logistie) 

( 1  +  e~x*6  )  1 

.955 

1.27 

1.27 

.75 

.75 

1.00 

e  /2,is  9 
l-e"'72,  x  >9 
(Double  exponential) 

e~°/2,  iSJ 
l-e'~+f>/2,  x>9 

.85 

-  +  -  tan'1  x 

2  * 

(Cauehy) 

-  +  —  tan-1  (x-  0) 

2  7T  ' 

.71 

From  table  6  it  may  be  concluded: 


(a)  that  there  is  no  clear-cut  ordering  of  the  detectors  on  the 
basis  of  known  ARE’s;  but 

(b)  that  if  one  wishes  good  asymptotic  performance  (in  the 
presence  of  increasingly  weak  signals)  for  logistic  type 
distributions  one  should  use  a  U  detector  and  avoid  tt  and 
-it'  detectors. 

The  final  goodness  criteria  to  be  discussed  are  adaptations 
of  very  recent  statistical  results  of  Bell  and  Doksum12. 


4.  Uniformly  Minimum  PFD  and  Almost  Locally  Minimum 
PFD  Detectors 


In  keeping  with  the  ideas  developed  in  the  preceding  section, 
two  other  optimal  properties,  which  depend  on  the  particular  class 
of  alternatives,  will  be  discussed  and  illustrated  here.  UM  PFD 
and  ALM  PFD  detectors  will  be  considered  here  to  be  as  defined 
in  the  Introduction. 


One  recalls  that  for  SDF  Model  I  detectors,  /3  is  a  function 
solely  of  h0  =  Fg  Fq1  ,  where  F0  is  the  pure-noise  cpf  and  F0  is  the 
noise-plus-signal  cpf.  By  restricting  attention  to  classes  J  Fg  j 
which  satisfy 


(!)  hg( n)  =  U  for  all  u\  and 

0  60 


(2)  gg  In  hp’  {u) 


=  J0(u)  exists; 


o=ec 


one  can  make  use  of  the  following  theorem  of  Bell  and  Doksum: 


12 


Theorem  6.  An  SDF  detector  whose  YES-NO  decisions 
are  based  on  the  statistic 


i~ 1 


Jo  [WO] 
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(i)  is  a  UM  PFD  detector  for  each  class  of  noise-plus-signal 
cpf's  with  he’(u)=exp  [a(0)  J0(u)  +  £>(0)]  where  a(0)=*O 
for  0  *  90  and  a( 0O )  =  b(0o)  =  0;  and 

(ii)  is  an  ALM  PFD  detector  for  each  class  of  noise-plus-signal 
cpf's  with  he'{u)  =  exp  [a(0)  J0(u )  +  b(6)  +  Q(u,  0)]  where 
a(6  )  and  b(6  )  are  as  in  (i),  and  lim  Q(u,  9  )/(6  -  90)  =  0  for 
all  u, where  the  limit  is  taken  as  9  tends  to  90  . 

Employing  this  theorem  and  the  fact  that  a  detector  based 
on  a  statistic  T '  is  equivalent  to  that  based  on  the  statistic  c(9  )T*  + 
d  (0  ),  where  c(0  )  *  0,  one  can  construct  table  7,  which  illustrates 
and  summarizes  known  results  in  this  direction.  The  complexity 
of  the  formulas  and  the  vacant  spaces  in  the  table  indicate  the  depth 
of  the  problem  here.  (Note  that  each  detector  is  an  SDF  detector, 
which  also  has  certain  optimal  properties  for  specified  classes  of 
noise-plus-signal  cpf's.) 


Table  8.  Statistical  Distributions  for  Model  I  Detectors 


Statistic 

Distribution 

Source  of  Tables,  Formulas, 
etc.  (Numbers  refer  to 
Bibliography.) 

0n 

(1-sample  Kolmogorov) 

21,  p.  423  ff 

0  n  >  0  n 

(1 -sample  one-sided 
Kolmogorov) 

22 

D'n 

23 

nJ'n 

(1 -sample  Cramer -von 
Mises) 

20;  21,  p.  443  ff 

(Equivalent  to  U ) 

V2n 

(Anderson-Darling) 

24 

Q'p 

Binomial  (sign) 

21,  p.  362  ff 

-N.y 

Qs 

Asymptotically  chi- 
square 

21,  p.  49  ff;  8,  p.  421  ff 

A  A 

Q,  Qs 

Asymptotically  chi- 
square  (Matusita) 

25;  21,  p.  49  ff 

Sn 

Asymptotically  normal 
(Sherman) 

26;  21,  p.  477  ff 

S'n 

(Kimball-Moran) 

Esn 

(Empty  cell) 

21,  p.  454  ff 

IT,  IT1 

Chi-square 

9;  21,  p.  49  ff 

U 

Sample  mean  of  rectan¬ 
gular;  asymptotically 

\  2  12  / 

27,  p.  257 

11 

Z 

Ar(0,  1) 

21,  p.  1  ff 

-2 

z 

Chi-square 

12;  21,  p.  49  ff 
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PS  DATA 


SIGNAL  / 

TRANSMISSION 

MEDIUM  ! 

DETECTOR 


| - O  YES 

1 — O  NO 


NOISE 


i  KNOWN 
iPURE -NOISE 
I  DATA 


COMPARE  X\,  •  •  •  ,Xn  AND  7X  ,  •  •  • ,  7m 


III.  MODEL  II  DETECTORS 


The  Model  n  detector  is,  as  would  be  expected,  an  exten¬ 
sion  of  the  Model  I  detector.  In  Model  II  PS  data  Xls  •  •  • ,  Xn  are 
available,  as  well  as  known  pure-noise  data  71}  •  •  • ,  Ym  .  The 
decision-making  process  then  consists  of  comparing  Xlt  •  •  • ,  Xn 
with  7,,  •  •  • ,  Yr..  and  deciding  YES  iff  they  are  "too  far"  apart  in 
some  sense.  Further,  one  recalls  that  by  imposing  the  reasonable 
SDF  and  Scheffe  conditions  of  theorem  2,  page  10,  one  is  led  to  the 
class  of  rank  detectors  which  base  their  YES-NO  decisions  solely 
on  the  ranks  R(X1),-  •  • ,  R(Xn),  R(YX),  •  •  • ,  R(Ym)  of  the  X's  and  7’s 
in  the  combined  sample  Xlt  •  •  • ,  Xn  ,  Y1 ,  •  •  • ,  Ym . 

As  in  the  case  of  Model  I  detectors  one  considers  three 
"natural"  subclasses  of  Model  n  detectors: 

1.  Sample  cpf  (SDF  Scheffe  Model  II)  detectors,  which  employ 
statistics  based  on  differences  of  the  PS-data  cpf  Fn  and 
the  pure-noise-data  cpf  Gm. 

2.  Run-block  (SDF  Scheffe  Model  II)  detectors,  which  base 
their  decisions  on  the  number  of  PS-data  values  X1 ,  •  •  • ,  Xn 
which  fall  in  certain  intervals  determined  by  certain  (pre¬ 
assigned)  order  statistics  of  the  pure-noise  sample 

7,  . . .  j 

3.  Rank-sum  (SDF  Scheffe  Model  II)  detectors,  whose  decisions 
are  based  on  sums  of  functions  of  the  ranks  R(X ]),'•’,  R  ( Xn), 

R(Yi), •  •  ■ ,  R(Y»)- 
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In  the  succeeding  three  sections  general  formulas  and 
specific  examples  of  the  common  types  of  Model  II  detectors  are 
developed.  The  fourth  of  the  succeeding  sections  introduces  a  de¬ 
tector  based  on  a  randomized  statistic,  which  is  relatively  new. 
This  new  class  of  detector  introduces  artificial  noise  into  the  de¬ 
tector  in  order  to  circumvent  some  difficulties  arising  in  the  use 
of  the  more  classical  DF  detectors. 

In  the  latter  sections  of  this  chapter,  the  various  Model  n 
detectors  are  compared  in  terms  of  such  previously  developed 
goodness  criteria  as  PFA  a,  PFD  j3,  and  max-min  /3. 


In  describing  the  Model  II  detectors  it  is  necessary  to  recall 
some  notation  and  to  introduce  other  notation. 

1.  PS  sample:  Xj ,  Order  statistics:  X(l)<  •  •  •  <X(n) 

2.  Pure-noise  sample:  Y\,  •  •  • ,  Y  Order  statistics:  Y (1)<*  •  •  <  Y(m) 

3.  Combined  sample:  {Zu  ■  •  ■  ,  ZN\  =  {jj,  *  *  * ,  Yn ;  Ylf  •  *  • ,  Ym\ 

where  N  -  n  +m 

4.  Combined  sample  order  statistics:  Z(l)<-  •  •  <Z(N) 

5.  Degenerate  cpf:  e  where  e(;r)  =  0  or  1  according  as 

x<  0  or  x  s  0 


G.  PS  sample  cpf:  Fn  where  Fr 


nix)  =  \  2 


e(x-Xi) 


i=l 


m 


7.  Pure-noise  sample  cpf:  where  G  (y)  =  -  )  €(y-Y,) 


J- 1 


N 


8.  Combined  sample  cpf:  F  where  F(s)  =  £(s-Zr) 

r=  1 


9.  Rank  of  X-  in  combined  sample:  R  (X  ) 
Rank  of  Y  in  combined  sample:  R(Y  ) 


are: 


Some  of  the  more  useful  relations  between  these  quantities 


1.  R&t)  =  2  e(Xi-Zr)=  mXi) 

r  =1 


2.  F(s) 


nFn(s)  +  mGm(z) 
n  +  m 


n  m  N 

3.  X  ^Ui)+X  r  N(N+ 1) 

i  =1  J=1  r  =1 


4.  The  possible  ranks  are  {l,  2,  •  •  • ,  N\  and,  consequently, 
the  collection  —  ,  R  (Xn)j  of  X-ranks  uniquely  de¬ 

termines  the  collection  ji?  (Xj),  •  •  • ,  S(Xn)}  of  7-ranks  and 
vice  versa. 


A.  MODEL  II  SAMPLE  Cpf  DETECTORS 


The  detectors  of  this  type  are  essentially  extensions  of  the 
Model  I  sample  cpf  detectors. 


1.  Kolmogorov-Smirnov  Detector 

r ;  *)  =  sup  j  | Fn  (x)  -  Gm(x)  |  [X(x)]  j 

x  '  ' 


The  usual  variations  of  this  statistic  are 


D(n,m)  =  sup  | Fn(x)  -  <5m(a:)| 


(33) 


x 


=  max  |ift  (Z(r))  -  Gm(Z  (r))|  =  m  1  max  | NFn  ( Z{r)~r 


r 

=  max 


max 

i 


<M*(i))|  »max  I  (Yti))  ~- 

J  * 


D+(n,  m  )  =  sup  [Fn(x)  -  5m(r)]  =  max  [_Fn  {. Z(r ))  -  Gm(Z(r))]  (34) 


=  max  !  max 

i 


•i-  ft 


(T(i))l  ,  max  fjn(7(j))  -  l 

j  1  m 


D'(n  ,  m  )  =  sup  [ft,(;r)  -  4(r)]  =  max  [ft,  (Z(r))  -  Fn  (Z(r))]  (35) 

x  r 


=  max  max 


Gm(X(i))  -  I 
n 


,  max 

J 


’i  -  Fn(Hj-)) 

m 


2.  Cramer-von  Mises  Detector 


Corresponding  to  the  two-sample  Kolmogorov-Smirnov  de¬ 
tector  is  the  two-sample  Cramer-von  Mises  detector  which  bases 
its  decisions  on  statistics  of  the  form  below. 


W  (n  >  m ;  r  ,  ) 


/  j| Fn(x)~  Gm(x)\  J*lF(x)]  j  d  Fix) 


(36) 
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For  an  illustration  of  the  computations  involved  with 

/ 

Kolmogorov-Smirnov  and  Cramer-von  Mises  detectors,  one  can 
consider  the  following  example. 
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Example  8. 


PFAq:  *  .  005  n  =  5,  m  *  6 

D(n,m  )  detector:  Decide  YES  iff 

max  |  Fn(Z(r))  -  Gm(Z(r))\  >  .833* 
r 

w\n,  m  )  detector:  Decide  YES  iff 
N-l 

i  2  [f„(2(r»  >  •  766* 

N  r*l 

PS  data  (J^):  3.2,  -1.4,  7.0,  4.5,  0.7 

Pure-noise  data  (Tj  ):  -1.6,  5.4,  -0.4,  -0.3,  2.5,  1.0 

Sample  Fn(Z(r))  G„  (Z(r))  Fn  (Z(r))  -  Gm  (Z(r))  [Fn  (Z(r))  ~  G:,  (Z(r))] 


0 

.167 

-.167 

.200 

.167 

.033 

.200 

.333 

-.133 

5  0.7 

6  1.0 


2.5 


8  3.2 


9  4.5 


10 


11  7.0 


.833 


.833  | 

.833 


1.000 


1.000  | 


D  (n,  77i )  detector:  D(n,  m)  =  .433  <  .  833  Decision:  NO 

W2(n ,  in )  detector:  W2(n,  m)  -  7-  (.500)  =  .045  <  .  766  Decision:  NO 


Obtained  from  ref.  21,  p.  443-444.  Since  the  exoct  value  PFA  Of-  .005  was  not  tabulated, 
.766*'  is  an  interpolated  value,  and  “.833"  yields  O'-.-  .004. 


From  the  table  above  it  is  easy  to  compute  the  values  of 
some  of  the  other  statistics  of  importance,  such  as 

(n,  m)  =  .033;  D  (n,  m )-.  433  and 

2  R(X)  =  2  +  5+  8+  9  +  11  =35. 


3.  Sign-Quantile  Detector 

The  final  Model  II  sample  cpf  detector  of  importance  is  the 
sign-quantile  detector.  The  rationale  for  its  use  is  the  immediate 
extension  of  that  for  the  Model  I  sign-quantile  detector,  which  is, 
roughly:  if  the  unknown  continuous  pure-noise  cpf  F0  and  the  un¬ 
known  continuous  PS-data  cpf  Fi  are  equal,  then  one  would  expect 
the  PS  sample  Xlt  •  •  • ,  Xn  and  the  known  pure-noise  sample 
Y\,  •  •  • ,  Ym  to  be  "well  interspersed.  "  More  specifically,  it  would 
be  expected  that  approximately  np  of  the  X's  and  mp  of  the  7’s  are 
less  than  or  equal  to  the  100  pthpercentile  Z(Np)  of  the  combined 
sample  Zlf  •  •  • ,  ZN  . 

Consequently,  for  0  =  qQ  <  qy  <  q2  <  •  •  •  <  qs  <  qs+1  =  1 
\Fn  ( Z(Nqi ))  -  Fn(Z{Nqul))  -  (q.  -  g..,)] 

and 

\Gm(Z{Nq.))  -GMNq^j)  ~  (g^^)] 

are  both  measures  of  deviation  from  the  "well-interspersed"  situa¬ 
tion.  Customarily,  one  uses  the  notation  =  qt  -  qi_1  ; 

nu  =  n\Fn(Z(Nqi))  -  Fn  (ZiNq^))  J 


n2i  =m\G7n(Z(Nqi))  -  Gm{Z(Nq._^)\  . 


and 


Hence 


~nu 

- Pi 

and 

n2i 

—  "  P, 

n  1 

m  1 

are  the  above-mentioned  measures  of  dispersion. 

For  visualization  it  is  perhaps  helpful  to  consider  table  9. 


Table  9 


Nss^£> 

Interval  NX? 

PS  Data 
Frequency 

Pure-Noise 
Data  Frequency 

TOTALS 

(-oo  ,  Z(Nq1)'] 

nn 

n21 

Np1  =  Nq1 

( Z(Nqx ),  Z(Nq2)-] 

n12 

n22 

Np2=N(q2-qi) 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

• 

(Z(Nqs),  «) 

nl,s+l 

n2,s+l 

NPs+i  =^(1"9S) 

TOTALS 

n1  =  n 

7^2  — 

N 

Z  ( Nq is  the  (lOOq^111  percentile  of  the  combined  sample 
and,  hence,  Nqt  of  the  combined  sample  points  are  less  than  or 
equal  to  Z(Nqi). 


Further,  if  the  unknown  cpf's  of  the  PS  data  and  the  pure- 
noise  data  are  equal,  one  expects  that  n-^is  approximately 
nJpi  =  np.  and  that  n2i  is  approximately  n2p_.  =  mp.  for  all  i. 

One  then  generalizes  the  contingency-table  tests  for  inde¬ 
pendence  to  obtain  detector  statistics  of  the  form  below. 


For  the  sign-quantile  detector, 


Q(nvn2  \  qp'“,  qs;r,  *) 


s  +1 

-I 


i  =1 


2 


I 


^  i  f  > 


(40) 


where  n1  =  n,  n2=m  to  simplify  the  summation,  and  the  p.  are 


such  that  each  Np  is  an  integer. 

On  setting  s  =  1,  q  =^,  r  =  2,  *( nn/rij  ,  Pt)  =  \! ni /Pt 

and  making  the  usual  chi-square  correction  for  a  2  x  2  contingency 
table,  one  obtains  the  median  detector 

«i=  ^  -1)2  (41) 

On  setting  p.  =  (s  +1)_1,  r  =  2  and  4'  =  1,  one  obtains,  on 


simplifying, 


=  (S+i)N  y  2_ 

^s+i  rim  2—i  “i  i 


The  Matusita  modification  yields 


3+1  /  s+1  \ 

2  (  I5T-  =  yspA 

i >i  \\i n  \j  *  /  \  *J™  i=i  / 


As  an  illustration  of  the  computations  one  might  consider 
the  following  example. 

Example  9. 


PFA  O'  =  .  005  n  =  20;  m  =  16 


detector: 


24  detector: 


Decide  YES  iff  ( |2nn  -  20|  -if 

=  .450  (|nn  -  10 1  -  .5)2>  7.88  (the  99. 5th  per¬ 
centile  of  the  chi-square  distribution  with  1 
degree  of  freedom) 

Decide  YES  iff  (4)(3G)  S  nv2  -  (2Q)(36) 

(20)(16)  A,  11  16 


=  .450  ^  n j,:2  -  45  >  10.60  (the  99.5th  per- 
i  =1 

centile  of  the  chi-square  distribution  with  3 
degrees  of  freedom) 
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A 

Q\  detector: 


PS  Data  (Xj): 
n  =  20 

Pure  Noise  (J7): 

771  =  16 


Decide  YES  iff  2  1  - 


v/(20)(16)  i= 1 


=  2  -  .112 


^  \fnlin2i 


>  q(.  995)  [where  q(.995) 


i=l 


is  the  as  yet  untabulated  99.5th  percentile  of  the 
distribution  of  Matusita's25  two-sample  statistic  ] . 


-.44,  -.10,  -.41,  1.08,  -.77,  -1.65,  1.19,  -.55 
-2.05,  1.04,  .05,  1.88,  .31,  -.96,  -.18,  .52 
-1.41,  -1.28,  .10,  -.39 

-1.18,  -1.31,  .52,  -.08,  -.86,  .41,  1.98 
.15,  1.14,  -1.95,  -.45,  1.28,  -1.55,  -67 
1.18,  -.31 
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Z(r) 

Sample 

Designation 

-2.05 

7 

-1.95 

7 

-1.65 

X 

-1.55 

7 

-1.41 

X 

-1.31 

7 

-1.28 

7 

-1.18 

7 

-  .96 

7 

-  .86 

7 

-  .77 

7 

-  .67 

7 

-  .55 

7 

-  .45 

7 

-  .44 

7 

-  .41 

7 

-  .39 

7 

-  .31 

7 

-  .18 

7 

-  .10 

7 

-  .08 

7 

.05 

7 

.10 

7 

.15 

7 

.31 

7 

.41 

7 

.52 

7 

Computations 
for  Qv  Q4,  Q\ 


five  7’s 
four  7's 


five  7' s 
four  7’s 


six  j’s 
three  7's 


four  X  s 
five  7's 


From  observing  the  ordered  values  above  one  can  construct 

A 


the  necessary  tables  for  computing  Qlt  Q4,  and  Q\  (tables  10  and 

11). 


Table  10 

For  computing  Q1 


\<S> 

xx 

Interval 

X 

Y 

TOTALS 

Z(  18)] 

1st  half 

10 

8 

18 

(Z<  18),  «) 

2nd  half 

10 

8 

18 

TOTALS 

20 

16 

36 

Table  11 

For  computing  Q4  and  Q\ 


Interval  nYo 

X 

Y 

TOTALS 

(-«,  Z(9)] 

1st  quarter 

5 

4 

9 

(Z( 9),  Z (18)] 

2nd  quarter 

5 

4 

9 

(Z  (18),  Z  (27)] 

3rd  quarter 

6 

3 

! 

9 

(Z( 27),  «) 

4th  quarter 

4 

5 

9 

TOTALS 

20 

16 

36 

Using  the  given  formulas  one  finds: 


<2j  =  .  113  <  7.  88  and  the  QL  detector  decides  NO; 

Q4  =  .90  <  10.60  and  the  Q4  detector  decides  NO;  and 


/>  ^ 

<2*4  =  .022.  (Sinee  the  has  not  been  tabulated  no  deeision 
can  be  made  here.  ) 

Optimum  ehoiees  of  the  number  (s  +  1)  of  intervals  and  other 
topies  related  to  goodness  criteria  will  be  treated  in  Section  E. 


B.  MODEL  II  RUN-BLOCK  DETECTORS 


This  elass  of  deteetors  is  also  based  on  the  measures  of 
relative  "interspersion.  "  An  Z-run  in  the  ordered  combined  sample 
is  a  conseeutive  sequence  of  X’s  immediately  preceded  and  imme¬ 
diately  succeeded  by  Ps.  A  7-run  is  similarly  defined.  If  the  PS 
data  and  the  pure-noise  data  have  identical  cpf’s,  one  feels  that  the 
two  samples  should  be  well  interspersed  and  that,  therefore,  the 
total  number  of  runs,  i.  e. ,  the  number  of  J-runs  plus  the  number 
of  7-runs,  should  be  relatively  large.  On  the  other  hand,  if  the 
pure-noise  (or  7)  cpf  and  the  PS  (or  Z)  cpf  are  as  illustrated,  one 
expects  that  the  number  of  runs  will  be  relatively  small. 


F0  (PURE  NOISE) 

F  (NOISE  PLUS  SIGNAL) 


Fix) 


alternatives 


Fix) 


alternatives 
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Once  the  eombined  sample  is  ordered  it  is  a  simple  matter 
to  ealeulate  the  total  number  of  runs.  However,  it  is  of  some 
value  to  give  an  explicit  formula  for  calculating  this  total  number 
of  runs.  To  that  end,  one  first  notes  that 


ei+i  =  e  (X(7(i+1))  “  Fn(Z{i))  -l/2 n]  =  1  iff  Z(i+ 1)  is  an  X ; 


0  if  both  Z(i+ 1)  and  Z(i)  are  X's  or  both  are  7’ s 
1  if  Z(i+ 1)  is  an  X  and  Z(i)  is  a  Y 
-1  if  Z(i+ 1)  is  a  Y  and  Z(i)  is  an  X; 


n- 1 

^  |ei+l_ei|  = 

i-1 


1  if  Z(i+ 1)  and  Z(i)  are  from  different  samples 
0  if  Z(i+ 1)  and  Z(i)  are  from  the  same  sample; 

number  of  changes  of  sample  J-to-7  and  7-to-X 
in  the  ordered  eombined  sample; 


and  that  the  total  number  of  runs  is  therefore 


n-1 

^=1+2  |e[Xn  (Z(i+1))  -Fn(Z(i))  -1/2 n] 
i=l 

-  e[Xn  (Z(i))  -Fn  (Z(i-l))  -l/2n}\  (44) 

Closely  related  to  the  run-detector  is  the  detector  based  on 
the  number  of  empty  7-bloeks.  Originally,  a  7 -block  was  defined 
as  the  interval  between  two  successive  7's  in  the  eombined  sample. 
A  large  number  of  empty  7-blocks  would  indicate  that  the  samples 
were  not  well  interspersed. 

The  77i  7-values  Yls  •  •  • ,  Ym  give  rise  to  (771+I)  bloeks: 

(  —  ,  r(l)],  (7(1),  7(2)], . ,  (7 (771-1),  7(7ti)]  ,  (7(777),  »).  The 

number  of  empty  7-bloeks  is  found  to  be 

771+1 

£  =  (m+l)  -  i  e  [ Fn(Y(j ))  -  Fn  (Y(j-l))  -  l/2n] 


where  7(0)  =  -  «  and  Y(m+1)  =  +«>. 


More  generally,  one  may  wish  to  consider  blocks  deter¬ 
mined  by  certain  percentiles  of  the  Y  (or  pure  noise)  sample.  Let 
0  =  q0  <  qx<  •  •  •  <  qs  <  gs+1  =  1  be  probabilities  such  that  each 
mq.  is  an  integer;  then  a  more  general  empty-block  statistic  is 

£(n,m  ,  <?!>•••>  <?s  )  =  (s+ 1) 


s+1 

"  2  e  |/n  (nraq  ))  -  Fn  (Yfmq.J)  -  l/2 n]  (46) 

j= 1 

The  detectors  based  on  statistics  (45)  and  (46)  are,  of 
course,  immediate  extensions  of  the  Model  I  empty -cell  detector. 
The  Model  II  extensions  of  the  Model  I  spacing  detectors  are  based 
on  statistics  of  the  form 


s+l 


3(n 


,m\q1,’",qs’,r,*)=  2  |^(^[(m+1)9J-]) 


-  Fn(Y[(m+l)q._1])  -  (g,  -  q.^)  ^  (47) 

where  0  =  gQ<  gx<  •  •  •  <  gs  <  gs+1  =  1. 

Corresponding  to  the  Model  I  situation  there  are  two  special 


cases , 


771+1 

5(77  ,  777)=^  l^nOT/))  -M7(j-1))  -A 
j=  1 


JR  +1 


(48) 


771+1 

52(”.  TTL)  =  J 

j= 1 


(^0'»  -FnlXU- 1))  f  (49) 


For  the  final  detector  discussed  in  this  section,  one  con¬ 
siders  a  statistic  specifically  designed  for  the  one-sided  situation 
just  illustrated  (A,  in  preceding  figure).  It  is  the  Epstein-Rosenbaum 


statistic  which  equals  the  number  of  J-values  exceeding  the  last 
7 -value,  i.  e. , 


L'  =  n  [1  -Jn(7(m))]  (50) 

Since  the  last  order  statistic  Y(m)  is  relatively  "unstable,  " 
one  quite  often  makes  use  of  the  more  general  statistic 

L(n,  m;  q)  =  n  [l  -  Fn  ( Y(mq ))]  (51) 

the  number  of  J-values  exceeding  the  100gth  percentile  of  the 
7-sample. 

Before  giving  an  example  of  the  use  of  the  statistics  de¬ 
scribed  above  it  seems  worthwhile  to  comment  that  except  for  R, 
the  statistics  described  above  are  asymmetric,  in  the  sense  that 
one  is  working  with  7-blocks  or  7-runs.  Detectors  with  similar 
rationales  could  be  constructed  if  one  considered  X-blocks  and 
J-runs.  Also  there  is  some  merit  in  considering  the  sum  of  the 
statistics  based  on  the  J-blocks  and  the  7-blocks.  This  latter 
technique  is  used  in  constructing/?  which  is  the  sum  of  the  number 
of  J-runs. 
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Example  10.  Consider 
=  16;  and  PFAcv  =  .005. 


r 

Z(r) 

Cum.  No. 

of  Runs 

Cum.  No.  of 
Empty  Y-bloeks 

V 

L  (n.  77i 

1 

X 

1 

0 

2 

Y 

2 

0 

3 

X 

3 

0 

4 

Y 

4 

0 

5 

X 

5 

0 

6 

Y 

6 

0 

7 

X 

7 

0 

8 

Y 

8 

0 

9 

X 

9 

0 

10 

Y 

10 

0 

11 

X 

11 

0 

12 

r 

12 

0 

13 

X 

13 

0 

14 

Y 

14 

0 

15 

X 

15 

0 

16 

X 

15 

0 

17 

X 

15 

0 

18 

r 

16 

0 

19 

X 

17 

0 

20 

X 

17 

0 

21 

Y 

18 

0 

22 

X 

19 

0 

23 

X 

19 

0 

24 

r 

20 

0 

25 

X 

21 

0 

26 

Y 

27 

X 

28 

Y 

24 

0 

29 

X 

25 

0 

30 

X 

25 

0 

31 

Y 

26 

0 

32 

33 

Y 

26 

1 

X 

wm 

ian 

34 

Y 

HK9 

35 

29 

l 

36 

30 

l 

5=30 

P  =1 

I'«  c 

»  L(n*n 
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The  decision,  rules  are: 


R  detector:  Decide  YES  iff  R  ^  10. 

£  detector:  Decide  YES  iff  -  (^  >  +  2.576 

-  <?(£) 

where  p(£)  =  n2/N  =11.11 
ct(£)  -nm/N^2  -  1.48 

L'  detector:  Decide  YES  iff  20  [l  -  Fn  (7(16))]  a  7. 

(Note:  The  value  "7"  corresponds  to  PFA  a  = 
.01.  There  are  no  tabulated  values  for  a  = 

.  005  r.s  of  this  writing. ) 

L(n,  to,  .  75)  detector:  Decide  YES  iff  20  [l  -  Fn  (7(12))] 

—  n  [1  -  p  ( a )]  +1  where  [  q  -  q  (  a)]  • 

j£lg>  +«<LLa>r1=*-1(a)and, 

(  n  m  )  v  ’  ’ 

consequently,  n[l-p(o!)]  +1=15. 

17 

S(n,  to  )  detector:  Decide  YES  iff  S|  Fn  (W))  - 

1  I  J=1 

Fn  (7 (j-1))  -  — |  -  bv  where  b1  is  the  untab¬ 
ulated  99. 5th  percentile  of  the  S(n,m) 
distribution. 


17 

82(11,  m)  detector:  Decide  YES  iff  Sk  (7(J»- 

J= 1 

1  I2 

Fn  (7  (j-1))  -  —  |  —  cp  where  cx  is  the  untab¬ 
ulated  99.  5th  percentile  of  the  S2(n,  m) 
distribution. 

Consequently,  the/?,  £,  L',  and  i(n,  to,  .75)  detectors 
decide  YES.  Further,  since  Zq  and  cx  are  unknown,  the  decisions 
of  the  S(n,m)  and  S2(n,  m)  cannot  be  determined. 

Optimum  choices  of  q  and  other  topics  related  to  goodness 
criteria  will  be  treated  in  Section  IIIE. 
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C.  MODEL  II  RANK-SUM  DETECTORS 


The  detectors  of  this  type  are  generalizations  of  the  detcc- 

n 

tor  based  on  the  statistic  V=  ^  R(X{).  The  intuitive  iustifica- 

i=  1 

tion  for  the  use  of  this  statistic  is  as  follows.  If  F0  >  F1  (see 
illustration,  p.  41),  then  the  X  distribution  is  stochastically  larger 
than  the  T  distribution,  and  one  expects  that  in  the  ranked  com¬ 
bined  sample  the  X  values  will  occupy  the  upper  ranks.  Hence 

n 

V  =—  ^  will  be  significantly  large. 

i=l 

In  order  to  determine  which  values  of  are  "sig¬ 

nificantly  large"  or  "significantly  small, "  one  needs  tables  other 
than  those  previously  used.  Therefore,  one  might  ask  which  con¬ 
stants  can  be  used  in  place  of  the  ranks,  to  permit  use  of  the 
"standard  tables"  and  satisfy  certain  goodness  criteria.  One,  then, 
is  going  to  consider  statistics  of  the  form 

n 

\  ^  h[R(Xi)]  (52) 

i=l 


where  h  is  an  appropriately  chosen  function.  Table  12  presents 
some  usual  choices  of  h. 


Table  12. 


1 

Statistics  - 


n 


7Z 


2  h[R(Xi)] 
1=1 


Corres. 

1-Sample 

Statistic 

Two-Sample 

Statistic 

hj(r)  (Expectations) 

h^ir)  (Percentiles) 

z 

Fisher-Yates- 
Van  der  Waerden 

£[5<r)|  <0] 

u 

Ma  nil -Whitney - 
Wilcoxon 

=jv7T 

Uo"'(lFi)=lfFl 

JT* 

Savage 

N 

2  i/j 
j-W-r+i 
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Where  $,  UQ  and  Hc  are,  respectively,  the  cpf's  of  the 
standard  normal,  standard  uniform,  and  standard  exponential  dis¬ 
tributions  (see  Section  II C);  £(r)  is  the  rth  order  statistic  of  a 
random  sample  jt/j  ,  W2,  —  ,  Wn  j  from  a  population  with  cpf  H; 
and  E[W(r)\  H~\  is  its  expectation. 

Historically,  the  Fisher-Yates  detector  was  studied  first. 
In  an  attempt  to  attain  approximate  or  asymptotic  normality  one 
replaces  each  R(X^)  by  E  j W  [l?(X^)]  |  $>J  to  obtain  the  statistic 

n 

^  E\w  [Rdi)]  |  *).  (53) 

i=  1 

This  statistic  turns  out  to  be  asymptotically  normal,  but 
for  small  or  moderate  values  of  n  and  m,  special  tables  of  signifi¬ 
cance  levels  are  needed.  Further,  one  needs  the  not-too -common 
tables  of  expected  values  E  [V(r)  |  $]  of  normal  order  statistics. 
To  circumvent  this  latter  difficulty  (which  does  not  exist  for  U0 
and  He  since  explicit  formulas  are  available  in  these  cases),  one 

introduces  the  statistic  based  on  percentiles  > 


Since  there  is  no  apparent  reason  for  restricting  considera¬ 
tion  to  statistics  related  to  $,  one  naturally  extends  consideration 
to  statistics  based  on  UQ  and  tfeand,  finally,  to  those  based  on 
arbitrary  strictly  increasing  cpf's.  The  general  rank-sum  detec¬ 
tors  are,  hence,  based  on  statistics  of  the  form 


n 

V'(H)  ^  E  l^CWi)]  I  *  |  and 

i  =1 


V(H)  =i 

71 


I 


H 


R(Xj) 
N  +  1 


(55) 


(56) 
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Before  illustrating  the  necessary  calculations,  one  should 
note  the  following: 


1.  In  practice,  one  usually  employs  principally  those  detectors 
for  which  H-  4,  UQ  or  He.  These  restrictions,  as  in  the 
one-sample  case,  coincide  with  certain  seemingly  unrelated 
information  theory  results. 

2.  The  statistics  V(H)  and  V'(H)  are  primarily  designed  for  the 
one-sided  situations  as  illustrated  on  page  41.  Several  de¬ 
tectors  for  other  types  of  situations  will  be  discussed  after 
the  numerical  example. 

3.  In  the  one-sample  case,  the  corresponding  rank-sum  de¬ 
tectors  are  based  on  statistics  of  the  form 

K(H)=^  £  H-1  [F0(Xi)]  • 

i=l 

Since  F0  ( Jj)  is  the  percentile  rank  of  Xt,  one  might  be  led 
to  consider  statistics  of  the  form 

n 

i=l 

This  statistic  is  not  well  defined,  since  H~l  is  only  defined 
for  numbers  between  0  and  1.  The  next  step  in  the  evolu¬ 
tion  is  the  detector  based  on 

n 

1  V 

n  ^  H  N 

1=1  L 

which  would  be  well  defined  except  for  the  fact  that 
H  "(1)  =  00 ,  if  H  is  a  strictly  increasing  continuous  cpf.  The 
final  result  is,  then,  the  given  statistic  V(H). 


4.  For  the  case  H  =  U0,  t/0_1(u)  =  u  whenever  0  <  u  <  1  and, 
hence, 

n  n 

1 


HUo)  =- 


l 

1=1 


Rtfi)  ; _ 

N+l  n{N+ 1) 


2 

i  =1 


R(*i) 


(57) 
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which  is  equivalent  to  the  more  usual 

n  n 

\  ^  R(Xi)  or  J  R(Xi)  , 
i= 1  i=l 

the  Mann-Whitney-Wilcoxon  statistic.  Further,  V(U0)  = 

V’(Uo). 

Finally,  to  facilitate  the  computations  in  the  example  below, 
one  needs  to  know  the  following  theorem  (see  ref.  8,  p.  500): 


Theorem  7.  If  F0=  F\ ,  i.  e. ,  if  the  PS  data  are  pure  noise, 

then 

n 

(i)  the  statistic  Vh  =  —  ^  h[R(Xi)]  has 

i-l 

N 

mean  E(Vh)  ^  h(r) 


and  variance 


' 

■■ 

TH 

var  ( Vh )  - 

;  nN2 

(JL\  ^  2  1 

\N-l)  X  h{r)  iV-1 

M  ^ 

_ ^ _ i 

2 

p=  i 

Lr=l  J 

j 

and 


(ii)  the  statistics 


V(H)  - 

E  [7(1/)] 

and 

V  '(H)  ~  E  [V  '(H)] 

y/var  [P  (i7)] 

yvar  [V(H)  ] 

have  asymptotically  a  normal  distribution  for  =  *.  u0, 
and  He. 


Example  11.  Consider  the  situation  for  which  n  =  10, 
m  =  8,  N  =  18,  PFA  a  =  .01  and  the  data  are  as  given. 

PS  data  (Jj):  18.1,  14.3,  15.2,  14.5,  17.9,  17.5,  15.3,  16.5, 
17.1,  16.6 

Pure-noise  data  ( YA:  14.2,  15.4,  16.4,  17.7,  18.2,  17.3,  15.1, 

16.1 


Using  normal  approximations,  one  derives  the  decision 
rules  below  for  the  detectors  described  above  and  for  the  one-sided 
alternatives  illustrated  on  page  41. 


V(UQ)  detector: 

Decide  YES  iff  ~  9‘ 5  <  -2.326. 

1.407 


7(4*)  detector: 

F(4>)  -  0 

Decide  YES  iff  v  ’  <  -2.326. 

•  ZttJ  Li 


V  (He)  detector: 

Decide  YES  iff  ~  °m  873  <  -2.326. 

.  254 


V'(He)  detector: 

Decide  YES  iff  <  -2.326. 


F'(4?)  detector: 

F'(4>)  -  0 

Decide  YES  iff  v  ’ -  <  -2.326. 

.  243 


The  ranked  data  and  computations  are  best  given  in  tabular 


form. 


r 

Z(r) 

Sample 

(Mann-Whitney 
-Wilcoxon) 
(.V+l)  V(U0) 
hi  (r)  =  r 

(Van  de 
Waerden) 
7(4-) 

h*r)=*~\7k) 

(Fisher- 

Yates) 

V'  (4-) 

h3(r)=E  [S(r)  |  4>] 

V(He) 
h4(r)  = 

ln(N-  1)  -  ln(N+l-r) 

V'  ( He ) 
hs(r)  = 

N 

Ivj 

j=N-r+ 1 

i 

14.2 

r 

1 

-1.62 

-1.82 

.054 

.056 

2 

14.3 

X 

2 

-1.25 

-1.35 

.111 

.115 

3 

14.5 

X 

3 

-1.00 

-1.07 

.171 

.177 

4 

15.1 

r 

4 

-0.80 

-0.85 

.236 

.244 

5 

15.2 

X 

5 

-0.63 

-0.67 

.305 

6 

15.3 

X 

6 

-0.48 

-0.50 

.379 

.392 

7 

15.4 

D 

7 

-0.34 

-0.35 

.459 

.476 

8 

16.1 

r 

8 

-0.20 

-0.21 

.546 

.566 

9 

16.4 

r 

9 

-0.07 

-0.07 

.641 

.666 

10 

16.5 

X 

10 

0.07 

0.07 

.747 

.777 

11 

16.6 

X 

11 

0.20 

0.21 

.864 

.902 

12 

17.1 

Q 

12 

0.34 

0.35 

.998 

13 

17.3 

7 

13 

0.48 

1.152 

1.212 

14 

17.5 

X 

14 

0.63 

0.61 

1.335 

1.412 

15 

7 

15 

1 

0.85 

1.558 

1.662 

16 

mm 

□ 

16 

1  - 

1.845 

17 

i 

Q 

1.25 

2.251 

18 

E 

1.62 

I  ~ 

2.944 

mr) 

■ 

■ 

0 

16.60 

2 h2(r) 

■ 

S 

13.23 

15.75 

26.23 

32.50 

E(Vh) 

■ 

1 

9.5 

0 

0 

.922 

var(  Vh) 

1.979 

.0643 

.0286 

.0592 

\J var  (Vh) 

1.407 

.232 

.254 

.169 

.243 

9.6 

.013 

.013 

.901 

.962 

Vh  -  E(Vh) 

.071 

.056 

.051 

-.124 

1 

-.156 

\/var  (7,,) 

Therefore,  each  of  the  detectors  decides  NO. 
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One  should  note  here  the  following  useful  lemma. 

Lemma.  If  U  is  a  random  variable  with  cpf  UQ,  then  for  each 
strictly  increasing  continuous  cpf  H 

is  a  random  variable  with  cpf  R\  and 

j  U0\  =  E[g(w )  |#]  whenever  such  expectations 

exist. 

Consequently,  one  can  rewrite  the  statistics 

n 

V'<H)  = \  J  E{w[R(Xi)]\H} 

i  =1 

as 

n 

V'(H)=^  ^  E(H~1{u[H(Xi)']  |  U0  }•  (58) 

i=l 

Recent  results13,14indicate  that  the  statistics  V'(H)  are  special 
cases  of  the  more  general  class  of  "locally  most  powerful"  rank 
statistics.  They  are  of  the  form 

n 

v{h>6)  ~~  ^  E  (j0  '{f/[i?(/i)]})  (59> 

i=l 

where 

U (i)  is  the  ith  order  statistic  of  a  sample  of  size  N  from  U0\ 
J0'(u)  =  In  h0'(u) 

0=6  o 

{  h6]  is  a  family  of  cpf’s  on  the  unit  interval  (0, 1)  such  that 
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is  non-zero  for  all  u  and  9  . 


dh0 

h0o(u)  =  u,  and  h0'  =  — 

[Model  II  detectors  based  on  these  statistics  correspond  to  the 
Model  I  detectors  discussed  immediately  preceding  Example  7 
(p.  45) ]  . 


Special  examples  of  these  statistics  are 


H 

^  I  £  (<F1{£/|>(Xi>]})2|  tfo] 


and 


1 

n 


1 

n 


£=  1 

n 

1 

i=  1 

n 

1 

i=  1 


E 


Hc~} 


Uo 


1  + 


Uo 


Hc  is  the  cpf  with  derivative  [tt(1  +^2)]-1 


(60) 


(61) 


(62) 


The  figure  on  page  76  illustrates  (A)  detectors  based  on 
equation  62,  primarily  designed  for  "location"  alternatives,  and 
(B)  detectors  based  on  equations  60  and  61,  used  principally  for 
"spread"  alternatives.  In  each  of  these  cases  there  are  difficulties 
in  computing  and  tabulating  the  expectations  E\W(r)  I  H ]  ,  as  well 
as  in  attaining  predetermined  PFA  a's.  To  prevent  these  diffi¬ 
culties,  one  should  consider  the  detectors  which  make  use  of  arti¬ 
ficial  noise  generators,  as  discussed  and  diagrammed  in  the 
following  section. 


D.  MODEL  II  ARTIFICIAL  NOISE  DETECTORS 


In  attempting  to  obtain  the  desired  preassigned  PFA  and  to 
tabulate  the  distributions  involved,  one  should  consider  an  adapta¬ 
tion  of  a  statistical  result  reported  by  Doksum.11 


A.  Overview. 

PS  DATA 

'"•Xn - J  DETECTOR  ' 


KNOWN 

ARTIFICIAL 

PUR  E-NOISE 

NOISE  DATA 

DATA 

WITH 

DESIRED 

J  1  ’  ’■*  ffl 

CPF  H 

B.  Detailed  sketch. 


PS  DATA 


CONVERTED  DATA 


Detector  employing  artificial  noise. 
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The  mathematical  basis  of  the  method  to  be  described  is 
theorem  8,  which  is  proved  in  reference  11. 

Theorem  8 ■  If  Zv  —  ,  ZN  and  ¥lt  •  •  * ,  WN  are  independent 
random  samples  with  common  continuous  cpf’s  G  and  H ,  respec¬ 
tively,  then  W[R(Zl)~\  ,  —  ,  W  [R(ZnY\  (i)  has  the  statistical  distri¬ 
bution  of  a  random  sample  with  common  cpf  H,  and 
(ii)  R{w[R(Zi)]}  =R(Zi)  for  i  «  1,  2,  •  •  •  ,  N. 

The  result  of  theorem  8  is  that  on  replacing  each  Z-value 
with  the  !/-value  of  the  same  rank,  one  is  able  to  replace  a  random 
sample  with  cpf  G  by  a  random  sample  with  cpf  H,  while  preserv¬ 
ing  the  ranks  on  which  the  Model  II  detectors  will  base  their  YES- 
NO  decisions. 

Since  the  Model  II  detectors  under  consideration  are  rank 
detectors,  and  since  the  described  transformation  preserves  the 
ranks,  there  is  no  loss  of  information,  but  a  gain  of  tractability, 
convenience,  and  economy  in  using  the  transformation  with  a 
judicious  choice  of  the  V  cpf  H . 

As  just  illustrated, 

1.  the  combined  sample  Xv  ■  •  • ,  Xn  Y1 ,  •  - " ,  Ym  is  ranked; 

2.  a  random  sample  W\,  ¥2,  •  ••  ,  WN  of  artificial  noise  with 

common  cpf  H  is  generated; 

3.  each  combined  sample  value  Xi  or  Y-  is  replaced  by  the 

¥  -value  ¥[R(Xi)~\  or  ¥[R(Yj)]  which  has  the  same  rank; 

4.  an  appropriate  statistic  is  evaluated  with  the  W  inputs 

¥ ( Xr) ;  W(Y{),-’-,  ¥(Ym);  and 

5.  on  the  basis  of  this  computed  value,  the  detector  decides 

YES  or  NO. 

(To  make  use  of  such  a  detection  procedure,  one  needs,  in 
addition  to  the  equipment  used  by  previously  described  detector 
models,  a  random  noise  generator  capable  of  generating  a  ¥ 
sample  having  the  desired  cpf  H. ) 


The  major  question  at  this  point  is:  Which  statistics  are 
appropriate  for  the  transformed  data? 


To  answer  this  question  one  first  considers  statistics  of  the 

form 


V'(H)  =~ 
n 


2 


i- 1 


E{w[R(Xi)]  | H) 


Since  for  fixed  n,  m  ,  and  hence  for  N ,  the  N  expected  values 
E[W(1)  |  #],•••,  E[W(N)  j  H]  are  fixed,  the  set  of  values 
Elwlmxj]  I  H},---,  E{V[R(Xn)]  I  H\  uniquely  determines  and 
is  determined  by  the  E{w\R  (Tj)]  j  H\  ,  •  •  •  ,  E\w[R  (¥m  )]  |  i? }  . 
Therefore,  the  statistic  V'(H)  is  equivalent  to  the  statistic 


n 

V"(H)  “i-  2  ElWlRiXt)]  |  H\ 
i=l 


1 

m 
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S  E{w[R(Yj)]  \h\ 


For  non-artificial  noise  detectors  it  is  more  convenient  to 
work  with  V'(H)  than  with  V"(H),  since  the  former  statistic  requires 
less  calculation  and  is  equivalent  to  the  latter. 

For  artificial  noise  detectors,  it  is  proposed  that  the  ex¬ 
pectation  signs  "E"  be  dropped  and  that  one  consider  statistics  of 
the  form 


and 


(63) 


T(H) 


Tl  TTi 

l  2  tfttftfi)]  2  W[R(Yf)] 

n  i= 1  m  J=  1 


(64) 


where  ¥lt  ■  ■  ■ ,  is  a  random  sample  of  artificial  noise  generated 
with  common  cpf  H.  Since  the  values  f'/(  1 ) ,  •  •  • ,  ¥(N)  are  not  fixed 
as  are  the  E  [&'(1)  j  H  ]  ,  •  •  •  ,  E[W {N)  \  E  ]  ,  the  statistics  T(H)  and 
T(H)  are  not  equivalent  as  are  Y'{E)  and  V"(H).  It  is  clear  that 


T(H)  does  not  make  use  of  all  the  randomly  transformed  data;  and  it 
has  been  proved11  that  under  some  usual  (translation)  alternatives 
and  for  large  sample  sizes,  detectors  based  on  T(H)  have  approxi- 

n 

mately  the  fraction  —  of  the  PFD  /3  of  detectors  based  on  T(H).  For 

that  reason  attention  will  be  restricted  to  detectors  based  on  the 
statistics  T(H). 

Before  illustrating  the  technique  outlined  above  it  is  neces¬ 
sary  to  state  the  following  theorem,  which  will  be  employed  in  the 
example. 


Theorem  9.  If  the  pure-noise  and  PS-data  cpf's  are  equal, 
and  ¥i ,  •  •  • ,  %  is  a  random  sample  generated  by  the  artificial 
noise  generator  with  cpf  H ,  then  T(H)  has  the  distribution  of  the 
difference  of  means  of  independent  random  samples  of  size  n  and 
m,  respectively  from  populations  with  cpf's  equal  to  H.  In 
particular, 

(i)  if  H=  $,>Jnm/N  T(<i>)  has  cpf  <t>; 

(ii)  if  H=  U0,  and  n  =  m.  =  N/2,  then  .  5  [l  +  T(U0)]  has  the  cpf 
of  the  mean  of  a  random  sample  of  size  N  from  U0,  i.  e. , 
its  density  is 

where  the  summation  is  over  r  ^  Nx\ 

(iii)  if  H  =  He,  and  n  =  m  =  N/2,  then  N  T{He )  has  the  cpf  of  the 
difference  of  two  independent  chi-square  variables,  each 
with  //degrees  of  freedom;  and 

(iv)  v/ 3 N  T(U0)  and  \fn/ 2  T(He)  are  asymptotically  N( 0, 1)  when 

n  =  m  -  N / 2 . 

It  should  be  remarked  that  with  reference  to  (iv)  of  Theorem 
9,  the  normal  approximation  is  "adequate"  for  n  =  m  =  N/2  ^  10. 


Example  12.  Consider  the  situation  for  which  n  -  m  =  10, 
N  =  20,  PFAo!  =  .  05  and  the  data  are  as  given. 

PS  data  (Xi):  18.1,  14.3,  15.2,  14.5,  17.9,  15.3,  16.5,  17.1, 
16.6,  18.4 

Pur  e-noise  data  (7j ):  18.3,  17.5,  14.2,  15.4,  16.4,  17.7,  18.2, 

17.3,  15.1,  16.1 


Using  results  (i)  and  (iv)  of  theorem  9,  one  derives  the  fol¬ 
lowing  decision  rules  for  the  case  ( FQ  >  )  illustrated  on  page  41. 

detector:  Decide  YES  iff  ' Jnm/N  T(ty  =  2. 236  !F(4>)  <  -1.  645 

T(U0)  detector:  Decide  YES  iff  \f M  T(U0)  -  7.746  T(U0)  <  -1.645 

T(He)  detector:  Decide  YES  iff  \!n  / 2  T(He)  =  2.  236  T(He)  <  -1.645 


The  detectors  above  need  artificial  noise  samples  generated 
from  (J0  and  He  distributions,  respectively.  One  such  collec¬ 
tion  of  samples  is  as  follows. 

<P  artificial  noise  sample:  2.455,  -0.531,  -0.634,  1.279,  0.046, 
-0.525,  0.007,  -0.162,  -1.618,  0.378,  -0.057,  1.356, 
-0.918,  0.012,  -0.911,  1.237,  -1.384,  -0.959,  0.731, 

0.  717 


Uo  artificial  noise  sample:  .950,  .455,  .317,  .869,  .358,  .853, 
.540,  .985,  .266,  .373,  .920,  .164,  .998,  .073,  .495, 
.496,  .641,  .417,  .906,  .903 


He  artificial  noise  sample:  .623,  .272,  .511,  .294,  .257,  .639, 
.818,  .165,  .110,  .691,  .606,  .317,  .219,  .428,  .223, 
.275,  .051,  .544,  .492,  .466 
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Table  13 


r 

X(r) 

Sample 

i 

Y 

14.2 

-1.618 

.073 

.051 

2 

X 

14.3 

-1.384 

.164 

.110 

3 

X 

14.5 

-0.959 

.266 

.165 

4 

Y 

15.1 

-0.918 

.317 

5 

X 

15.2 

-0.911 

.358 

.223 

6 

X 

15.3 

-0.634 

.373 

.257 

7 

Y 

15.4 

-0.531 

.417 

.272 

8 

Y 

16.1 

-0.525 

.455 

.275 

9 

Y 

16.4 

-0.162 

.495 

.294 

10 

X 

16.5 

.496 

11 

X 

16.6 

.540 

12 

X 

17.1 

0.012 

.641 

13 

Y 

17.3 

0.046 

.853 

14 

Y 

0.378 

.869 

15 

Y 

17.7 

0.717 

.903 

| 

16 

X 

17.9 

0.731 

.906 

17 

X 

18.1 

1.237 

.920 

18 

Y 

18.2 

1.279 

.950 

.639 

19 

Y 

18.3 

1.356 

.985 

.691 

20 

X 

18.4 

2.455 

.998 

N 

1  ¥l 

1=1 

.519 

11.979 

n 

i 

.497 

5.662 

771 

l 

.022 

6.317 

3.988 

T{H) 

.0475 

-.0655 

On  the  basis  of  these  computations,  all  three  of  these  detectors  decide  NO. 
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For  generating  artificial  noise  with  a  given  strictly  in¬ 
creasing  continuous  cpf  H,  it  is  often  most  convenient  to  make  use 
of  the  lemma  on  page  88.  One  first  generates  uniform  random 
numbers,  U x  ,  •  •  • ,  t/y  ,  i.e. ,  random  numbers  with  cpf  U0\  and  then 
transforms  them  using  ff-1to  obtain  the  sample  •  •  • ,  H~l  (UN) 

which  is  statistically  equivalent  to  artificial  noise  fror  ;  an 
"^-generator.  " 


Uo  GENERATOR 
(OR  RANDOM 
DIGIT  GENERATOR) 

U2  •  *  *  *  %U jf 

H'1 

CONVERTER 

\wt  =H~1(Ui)fori  =!,•••,  n] 

Artificial  H— noise  generator. 


From  these  considerations  one  sees  that  the  statistics 


T(H) 


I 


W\R(Yj )] 


can  be  rewritten  as 


n  m 

T(H)^  J  S  tf-1  MW/)]} 


(65) 


Corresponding  to  the  results  mentioned  in  the  latter  parts 
of  sections  II B  and  fflC,  it  is  found12  that  detectors  based  on  the 
statistics  T(H)  are  special  cases  of  detectors  based  on  statistics 
of  the  following  forms: 


T(h,e  ) 


1 

n 


I 


Jo 


j0  jtf[ft<77)]}  ; 

(66) 
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and 


n 

I  re  l,„0 

i  =1 

Til 

-h  2  re  l„So  <67> 

J  ^ 

where 

JG{u)~  In  <V(u); 

U(  1)  <  •  •  •  <  U(N)  are  the  order  statistics  of  a  random 
sample  from  U0;  and 

|  h0  }  is  a  family  of  cpf's  on  the  unit  interval  such  that 

dh0 

hn'  -  -  exists  and  is  positive  on  the  unit  interval,  and 

du 

he  {u)  -  u. 

°o 

The  most  common  versions  of  these  statistics  are  with 


0-0q 


F'  ( F~Hu )) 


and  with 


d_ 

do 


J/"  (F-:(u))]  F-\u) 
F'iF^iu)) 


where  F  is  an  arbitrary  strictly  increasing  cpf  with  two  derivatives. 
The  first  of  these  two  types  is  primarily  for  "location"  alternatives, 
and  the  detectors  based  on  statistics  of  the  second  type  are  for 
"spread"  alternatives. 
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E.  GOODNESS  CRITERIA  FOR  MODEL  II  DETECTORS 


As  was  the  case  for  Model  I  detectors,  all  goodness  criteria 
will  be  based  on  PFD  /3  (for  fixed  PFA  a  ),  and  since  these  Model  II 
detectors  are  SDF,  the  PFD  will  be  a  function  solely  of  F0  ■ 
Further,  because  of  the  multitude  of  possible  alternatives  it  is  physi¬ 
cally  impossible  to  make  a  "reasonably"  complete  tabulation  of 
PFD.  Consequently,  one  is  led  to  a  comparison  of  detectors  based 
on  (1)  max-min  power  for  one-sided  bands;  (2)  PFD  for  Rayleigh 
alternatives;  (3)  ARE;  and  (4)  LM  PFD  (locally  minimum  PFD)  and 
ALM  PFD  (asymptotically  LM  PFD). 


1.  Minimum  PFD  for  One-sided  Bands 

The  results  here  are  adaptations  of  the  statistical  work  of 
Bell,  Moser,  and  Thompson,10  which  is  a  partial  extension  of  the 
statistical  work  of  Chapman.9  One  first  considers,  for  each 
strictly  increasing  continuous  pure-noise  cpf  F0,  the  family 
Q  (F0,  A)  of  noise-plus-signal  cpf's  Jq  satisfying  (1)  Fx  <  F0  and 

(2)  max  [.F0(a:)  -.^(ar)]  =  A.  (An  illustration  is  given  on  p.  47.) 

x 

In  order  to  apply  the  max-min  PFD  criteria  one  must  compute  both 
the  maximum  and  the  minimum  PFD  for  F0  vs.  noise-plus-signal 
cpf's  in  the  one-sided-band  family  0(Fo,  A).  Unfortunately, 

Bell,  Moser,  and  Thompson10  were  unable  to  derive  a  reasonable 
computation  procedure  for  maximum  PFD.  The  only  results  avail¬ 
able  are  those  for  minimum  PFD,  as  given  in  the  graphs  on  pages  99 
111.  Here  the  PFA  =  . 01  or  .  05;  j3  is  plotted  vs.  A;  and  the  sample 
sizes  range  from  n  =  4,  m  =  5  to  n  =  10,  ra  =  10.  These  graphs 
are  based  exactly  on  the  complete  set  of  computations  of  Bell, 

Moser,  and  Thompson,  with  the  same  cases  omitted.  Those 
labeled  A  through  L  represent  the  results  of  the  use  of  accurate 
approximation  formulas,  while  the  curves  in  the  M  graph  are  the 
result  of  Monte  Carlo  calculations  for  T($)  with  1000  trials  per 
point.  It  is  felt  that  since  other  Monte  Carlo  computations  in  this 
study  are  based  on  10,000  trials  per  point,  the  results  for  T($) 
are  somewhat  less  accurate  than  the  other  Monte  Carlo  computations 
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Minimum  PFD  for  one-sided  noise.  (Continued) 


A 


Minimum  PFD  for  one-sided  noise 


or  one 


On  the  basis  of  the  Bell-Moser-Thompson  calculations  for 
the  Fisher-Yates,  Epstein-Rosenbaum,  Savage,  Van  de  Waerden, 
Mann  Whitney- Wilcoxon  and  Cramer-von  Mises  Model  II  detectors, 
one  concludes: 

a.  On  the  basis  of  minimum  PFD, 

F’($)|  7(4>)}  V(U0)  =  F'(0o)}  7'(ffe)}}  L'~W2(n>m) 

where 

very  slightly  better  than 
slightly  better  than 
better  than 
}  |  =  much  better  than 
~  =  approximately  equal 

b.  The  max-min  criterion  cannot  be  applied  here  until  com¬ 
parable  maximum  PFD  calculations  are  made, 

2.  PFD  for  Rayleigh  Alternatives 

One  next  considers  comparison  of  the  Model  n  detectors  on 
the  basis  of  Rayleigh  alternatives,  which  are  considered  quite  im¬ 
portant  in  signal  detection  (see  illustration,  p.  53).  Because  of 
time  limitations,  computations  were  made  only  for  D(n,m), 

D~(n  ,m),  W2(n,  m),  and  V(UQ);  for  6  =  2  and  4;  and  for  sample  size 
combinations  (5,  5),  (5,  10),  (10,  5),  and  (10,  10).  For  computa¬ 
tional  convenience  the  curves  are  plotted  in  the  form  &  vs.  /?  for 
the  cases  mentioned  in  the  following  figures  (p.  113-116).  On  the 
basis  of  these  computations  one  concludes: 

a.  For  Rayleigh  alternatives 

7(U0)  }zT(n,ui)ff  W2(n,  m  )  }  Din,  m) 

b.  If  one  wishes  to  use  a  Model  n  detector  which  (besides  its 
DF  property)  performs  well  against  Rayleigh  alternatives, 
one  should  choose  a  V(U0)  detector,  and  avoid  using  a 
D(n,m)  detector. 


}  - 

|  = 
}  = 
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PROBABILITY  OF  A  FALSE  DISMISSAL  j8 


PFD  vs.  PFA  for  Rayleigh  noise. 
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3.  Asymptotic  Relative  Efficiency  (ARE) 


The  ARE  results  for  Model  II  detectors  are  exactly  the 
same  as  those  for  the  corresponding  Model  I  detectors.  In  fact 
one  has  the  following  theorem  from  Bell  and  Doksum.11 


Theorem  10.  Let  H  be  a  strictly  increasing  continuous  cpf 
with  second  moment;  and  let  |<7y  |  be  a  family  of  noise-plus- 
signal  cpf's  satisfying  (a)  GN  ^  F0  and  (b)  lim  J dF0  =ias  N 
tends  to  infinity.  Then  one  has 


A\7'(H),  7(5)]  =  A[V'(H),  T(H)]  =A  [7(5),  T(H)]  =1 


for  Model  II  detectors;  and  in  comparing  Model  I  and  Model  II 
detectors, 


A[K(H),  K(V1)]=A  [7'(5),  7,(51)]  =  A[V(H),  7(5^]  =  A[T(H),  =1 


for  all  (sufficiently  regular)  H  and  H1.  Consequently,  one  has  for 
translation  alternatives  table  14,  which  is  an  exact  replica  of 
table  6.  From  this  table  one  concludes: 

a.  that  on  the  basis  of  known  ARE's  there  is  no  clear-cut 
ordering  of  the  Model  II  detectors;  but 

b.  that  if  one  wishes  good  asymptotic  performance  (in  the 
presence  of  increasingly  weak  signals)  for  logistic  type 
distributions  one  should  use  a  V (U0)  or  T(U0)  detector  and 
avoid  the  use  of  the  following  detectors: 


7(5C),  7(5C),  V'(He),  7'(5e), 


T(He),  T(Hc). 
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Table  14.  ARE's  for  Model  II  Detectors 


\ 

\ARE's 

TRANSLATIOlK 

ALTERNATIVES^^ 

^  •  - 

•  Eh 

£  :  ~ 

*  _  *  >& 

■  g 

ii  ii 

,3  j 

Eh  •  I 

'S-  -  •  h  1 

^  ^  *  X  1 
^  ^  ! 

E^ 

l^j  ^  ^ 

^  II  II 

! 

3  ^ 

1 - I  0) 

"3>  ^  5 

^  »§r  I  ^ 
>2^  •  *0^ 

Er-H 

^  II  II 

fc> 

'r' 

^,3  :  ^ 

'2  ti  Eh 

fc*  1~‘  1 — 1 

i — i  ^  ^ 

II  II 

I6:  .g 

t=^  ""5)  . 

.  tt3  •  ^ 

3r  :  - 

3-  ^  Eh 

^  L— 1  ‘z1 

1 — 1  ^  ^ 

^  II  II 

^  * 

^  -  ?ta 

^  3  ■ 

~  * 

3  L>  Eh 

s: i— ■  l-j 

1 — 1  ^  ^ 

-sac*  II  II 

Normal 

F0  {x)  =  $  ix) 

1.05 

Uniform 

F0(x)  =  U0{x) 

OO 

oo 

OO 

1.00 

Exponential 

F0  (x)  =  He(x) 

CO 

oo 

.333 

Negative  exponential 
F0(x)  =Ee(x) 

=  er,  0 

oo 

oo 

.333 

Logistic 

F0(x)  =  1  1  — 

1  +  e  x 

m 

1.27 

.75 

.75 

1.00 

Double  exponential 

r  .  _fe~/ 2.  x^O 

Fo  {X)~ 

1-eV  2,x  >0 

.85 

Cauchy 

1  1 

F0(x)  =2+n  ^  x 

.71 

• 

118 


4.  Almost  Locally  Minimum  (ALM)  PFD  Model  II  Detectors 


Using  the  randomized  statistics  of  Section  HID  and  the 
following  theorem  of  Bell  and  Doksum12,  one  can  construct  ALM 
PFD  detectors  for  certain  specified  noise  classes. 

Theorem  11.  If  conditions  (i),  (ii),  (iii)  and  one  of  (iva), 

(ivb)  and  (ivc)  below  hold  for  strictly  increasing  continuous  pure- 
noise  cpf  F0  and  the  noise-plus-signal  class  j  Gg  =he  '  (F0) } ,  then 
a  Model  II  detector  which  bases  its  YES-NO  decisions  on  the 
statistic 

n  m 

T(h,e)  =  l  2  Jo  |^(^)][  S 

i=l  ' '  j= 1 

or  on  the  statistic 

n  m 

2”(h,0o)=-  J  Jo1  {tf  [*<**)]} 

n  i=1  m  J=1 

g 

(where  Jg(u)  =  In  h0'(u)\  J0'{u)  =  —  In  he'(u)  | 

and  U(  1),  •  •  •  ,  U (N)  are  the  order  statistics  of  a  [70-random  noise 
generator)  is  ALM  PFD. 

Regularity  Conditions: 

(i)  he  has  a  derivative  h0',  which,  for  almost  all  u,  is  con¬ 
tinuous  in  some  nondegenerate  closed  interval  I(0O)  contain¬ 
ing  0O ,  and  satisfies  hg'{u)  =  1; 

(ii)  thei-e  exist  functions  M0(u)  and  Mi(u)  integrable  over  (C,  1) 
and  such  that  for  all  u  and  all  6  inl(0o), 


0  <  hg(u)  —M0(u)  and  |hp(u)|  -lfi(u); 
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0 


n 

(iii)  0  <  lim  —  =  r  <  00  as  N  tends  to  infinity; 


(iva)  J0 '  is  the  inverse  of  a  cpf  H  with  finite  second  moment; 

1 

(ivb)  Jo'  is  convex,  J  D^o'(u)]  du  <  °°  and  J'0  is  bounded 


0 


below;  and 


(ivc)  there  exists  a  continuous  cpf  H  such  that  Jo'[^(^)]  is 


convex,  bounded  from  below,  |  J~ xdH(x) 

J  J0'  [ H(x )]  'JH(x)  < 


<  °°,  and 


Table  15  gives  the  Je  and  J0'  for  detectors  which  are  ALM 
PFD  with  respect  to  some  special  classes  of  noise-plus -signal 
cpf's.  Since  the  detectors  in  question  are  SDF,  the  functions  J0 
and  J0'  depend  only  on  he=  Fe  F0-1  . 
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Table  15.  ALM  PFD  Deteetors  for  Certain  Noise-plus-signal  Classes  |iv>(^)}  of  epf 


Table  16.  Statistical  Distributions  for  Model  II  Detectors 


Statistic 

Distribution 

Source  of  Tables,  For¬ 
mulas,  etc.  (numbers 
refer  to  Bibliography) 

D(n,m) 

(2 -sample  Kolmogorov- 
Smirnov) 

21,  p,  434  ff 

D~{n,m),  D+(n,  m) 

(2-sample,  1-sided 
Kolmogorov -Smirnov) 

21,  p.  431  ff 

W2(n,  m) 

(2-sample  Crame’r-von 
Mises) 

21,  p.  443  ff;  23 

W+(n,m), 

W~(n>  n) 

[Equivalent  to  7([/0)] 

Qi 

Asymptotically  chi- 
square  (median) 

21,  p.  49  ff;  28,  p. 

226  ff 

Qs+ 1 

Asymptotically  chi- 
square  (extended  median) 

21,  p.  49  ff;  28,  p. 

296  ff 

Qs+i 

Asymptotically  chi- 
square  (Matusita) 

21,  p.  49  ff;  25 

e 

(Empty  block) 

29,  p.  446 

S(n,  m) 

(2 -sample  Sherman) 

S2(n,  m) 

(2-sample  Kimball- 
Moran) 

L » 

(Epste  in-Rosenbaum) 

21,  499  ff 

L 

(Moses  "extreme 
reaction") 

30 

V{Uo),  V"{Uo) 

(Mann-Whitney-Wilcoxon) 

22,  p.  325  ff 

v n  m 

Asymptotically  normal 
(Fisher-Yates) 

8,  p.  498;  31,  p.  66 

7(<f.) 

Asymptotically  normal 
(Van  de  Waerden) 

32 

T  (<1>) 

Normal  (Doksum) 

11,  12 
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IV.  MODEL  III  DETECTORS 


As  mentioned  in  Section  I,  the  Model  in  detectors  are  ex¬ 
tensions  of  the  Model  II  detectors.  The  region  being  searched  or 
examined  is  divided  into  k  subsectors  and  the  PS  data  from  the  k 
subsectors  are  compared.  The  detector,  then,  on  the  basis  of  the 
appropriate  ^-sample  statistic,  decides  NO  iff  there  is  no  signifi¬ 
cant  difference  between  the  k  PS  data  samples: 

Sector  1:  XU,X12,-",  x\,nl 

Sector  2:  X21  ,  X22  » ’  "  »  x2 ,n2 


Sector  k  :  -^jci  >  xk2>  >  Xk ,n^ 
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The  three  "natural"  subclasses  of  Model  III  detectors  are: 

(1)  Sample  cpf  (SDF  Scheffe  Model  III)  detectors; 

(2)  Run-block  (SDF  Scheffe  Model  III)  detectors;  and 

(3)  Rank-sum  (SDF  Scheffe  Model  III)  detectors. 

These  three  subclasses  are  introduced  and  illustrated  in 
the  next  section,  where  the  following  notation  will  be  employed 

m  -  the  number  of  data  points  in  the  ith  PS  data  sample; 

k 

k  =  number  of  sectors;  and  N  =  ^  . 

i  =1 

PS  sample  from  sector  i'-  1>  %i 2>  ■  '  ■  >  ^i,ni 

Order  statistics:  Xj(l)  <  Xj(2)  <•  •  •  < 

Combined  sample:  \zlt  •  •  • ,  ZN  }  =  j Xn  ,  •  •  • ,  J1>ni  ;•  •  • ; 

**.„k  | 

Order  statistics:  Z  (1)  <  •  •  •  <  Z(N) 

Degenerate  cpf  e,  where  e(x)  =  0  or  1  according  as 
x  <  0  or  i^O. 

Sample  cpf  of  ith  sample;  Fi  ,  where 
Fi(x)=-  Y  e(x-Xif) 

1  r  1 

Combined  sample  cpf:  F ,  where 

N 

^(;  )  =~  ^  ~  7jr  > 
r=  1 
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Rank  of  X*7-  in  combined  sample:  R(Xij) 


Ri+  1  R(Xi  ),  the  sum  of  the  ranks  of  the  members  of 
3= 1 

the  ith  sample 

Artificial  noise:  ,  •  •  • ,  WN 

Order  statistics:  ¥(1)  <  •  •  •  <  W(N) 


Wu  =  I  W  [f?  (Xu)]  ,  the  sum  of  the  artificial  noise 

j=l 

values  having  the  same  ranks  as  the  members  of  the  ith 
sample 

h  Tli 

¥++  =  2  2  vtRixij  )]  ,  the  sum  of  all  of  the  artificial 

<=1  3  =  1 
noise  values 


H:  the  cpf  of  the  artificial  noise  sample 

nt 

EU(H)=  X  E\w[R(Xij)]  I  H\ 

3=  1 

h  m 

E++{H)  =  I  I  |  H  | 

i=l  j=  1 
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For  some  of  the  detectors  to  be  described  it  will  be  neces 
sary  to  consider  blocks  of  the  order  statistics  Z(  1)  <  •  •  •  <  Z(N). 
The  blocks  are  chosen  to  be  of  equal  size  (if  possible),  and,  conse 
quently  (wherever  possible), 

b  ,  the  number  of  blocks,  is  a  positive  integer  which 
divides  N  and 


The  b  blocks  of  order  statistics  are,  then, 


n-ij  -  number  of  members  of  the  ith  sample  in  the  jth 
block 
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Further,  it  will  be  helpful  to  recall  some  of  the  more  use¬ 
ful  relations  between  these  quantities 

N 

R(Xij)  =  2  e  ~  zr)  -  N  FUii) 

r  =1 

k 

X(z)  2  ni  F  ^r))  = 

1=1 

k  nt  k 

2  J  R(Xij)'*\w+  1)=  X  Ri  + 

i=  1  .7=1  i=l 


y++2 


5  fc 

2  ni.i=ni>  2  nij=N/b 

7=1  i=l 

One  can  now  introduce  and  illustrate  the  model  III  detectors. 


A.  MODEL  III  SAMPLE  Cpf  DETECTORS 

The  sample  cpf  detectors  are  those  based  on  the  given 
statistics. 
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1.  Kolmogorov-Smirnov  Detector 


D(nlt  nk  ;  s;  *) 

k 

=  max  7  j  \Fi  (Z(r))  -F(Z(r)) 
r  i=l 

for  which  the  most  usual  special  case  is 


Jrii  *[>(Z(r))]  j  , 
(68) 


D(m,  •  •  * ,  nk  ;  2;  1)  =  max 

v 


k 

2  Hi  ?,2(Z<r))  -  £ 
i=  1 


(69) 


2.  Cramer-yon  Mises  Detector 


nk  ;  s  ;*) 

N  k 

=  ^  I  I  j  |ff  (Z(r))  -  7(Z(r))|  >/nt*|7(2<r))]j 
r=l  i=l  (70) 

for  which  the  usual  special  case  is 


N 


~(ni 


n. 


2;  1)  = 


N 


Fr{Z(r))  - 


r =1  i=l 


(N+l)  (2N+1) 

6  N 


(71) 


3.  Sign  Quantile  Detector 


Q(nv--,  nKiq1r'’tqmia,  *) 

m  +1  k 

=  2  2  [  |  -*i  -K  (Z«H))  -<v9t-i>l*it 

4=1  4=1  '  ,72) 

for  which  the  most  usual  version  is 


k  b 


Q(n 


l,  *•*,  ;  6)  =  b  J  ^ 


i=l  .7=1 


rii 


-  TV 


(73) 


B.  MODEL  III  RUN-BLOCK  DETECTORS 


Model  III  run-block  detectors  are  used  very  sparingly,  but 
the  most  usual  one  is  based  on  the  Mosteller-Tukey  statistic 

M,  length  of  last  run,  =  max  m  [1-J-j  (/t(nt))]  (74) 


C.  MODEL  III  RANK-SUM  DETECTORS 


Model  III  rank-sum  detectors  are  based  on  statistics  cor¬ 
responding  to  the  numerator  of  the  "F  -  statistic"  in  the  ordinary 
analysis  of  variance 
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Usually  one  chooses  H  =  U0  to  obtain  the  statistic  of  the 
Kruskal-Wallis  detector 


S(U0)  = 


12 


Y  Ru2 


N{N+1)  ^  nt 


-  3(N+1) 


(76) 


or  H  =  $  to  obtain  the  Model  III  Van  de  Waerder  detector 


k 


-  2 


-lx2 


(Ru) 


i- 1 


Hi 


(77) 


The  detectors  based  on  the  expected  values  E\W  (r)\  H~\  in¬ 
stead  of  the  percentiles  R~l(jj^j  are  those  whose  statistics  are 


of  the  form 


S'(H)  =  Z,  nt 
i= 1 


’Ei+(H)  E++  (H)  \2  V  (R)  R++2  (R) 


N 


-  2 

i=l 


N 


S'(U0)  =  S(U0),  and 

k 

i  =1 


(78) 

(79) 

(80) 


are  the  usual  special  cases  of  this  statistic. 


D.  MODEL  III  ARTIFICIAL  NOISE  DETECTORS 


The  final  Model  III  rank- sum  detectors  are  those  which 
make  use  of  artificial  noise  in  making  decisions.  (See  illustrations, 
pp.  90,  96).  The  statistics  employed  here  are  based  on  an  idea 
of  Doksum11  and  are  of  the  form 
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(81) 


where  H  is  the  epf  of  the  artificial  noise  random  sample  Wlt  •  ■  • ,  WN  . 

S (4>)  is  the  most  common  version  in  use.  (82) 

To  illustrate  the  computations  involved,  the  following 
example  is  presented. 

Example  13.  The  region  being  searched  is  divided  into  four 
subregions,  and  the  PFA  is  set  at  oi  =  .  05.  The  data  from  the  sec¬ 
tors  turn  out  to  be 

Sector  1:  4.7,  4.1,  4.5,  7.6,  6.9,  5.8 

Sector  2:  4.8,  4.4,  4.9,  5.7,  7.0 

Sector  3:  4.3,  5.4,  3.4,  4.6,  5.0 

Sector  4:  5.6,  5.3,  6.4,  6.0,  5.2,  6.5,  5.1.  4.2 

Consequently,  one  has 

fr=4,  ^=6,  n2= 5,  n3=5,  n4  =8,  and  N= 24. 

The  decision  rules  for  the  various  detectors  are  as  given. 

A  A 

M  detector:  Decide  YES  iff  M  —  4,  the  95th  percentile  of  the 
Mosteller-Tukey  table.33 

lt/-2(6,  5,  5,  8)  detector:  Decide  YES  iff  W2  >  1.  00,  the  95th  per¬ 
centile  of  the  Kiefer  table.34 

.P(6,  5,  5,  8)  detector:  Decide  YES  iff  D  >  3.  06,  the  95th  per¬ 
centile  of  the  Kiefer  table.33 

S(U0)  =  S' (U0)  detector:  Decide  YES  iff  S{U Q)  >  7.  81,  the  95th 

percentile  of  chi-square  distributions  with  3  degrees 
of  freedom. 
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S($)  detector:  Decide  YES  iff  >  7.  81. 


Q  (6,  5,  5,  8;  3)  detector:  Decide  YES  iff  Q  >  12.59,  the  95th  per¬ 
centile  of  the  chi-square  distribution  with  6  degrees 
of  freedom. 

Since  all  of  the  detectors  being  considered  base  their  de¬ 
cisions  solely  on  functions  of  the  ranks  B(  7),  the  data  may  be 
ranked  as  shown  in  table  17. 

The  last  column  of  the  table  contains  the  ordered  values  of 
the  data  produced  by  an  artificial  normal  noise  generator:  0.344, 
-0.664,  1.351,  -0.429,  -2.510,  -0.148,  -0.132,  -0.605,  0.379, 
0.394,  0.526,  -1.354,  1.119,  0.705,  -1.167,  0.256,  -0.517, 
-0.084,  1.553,  0.588,  0.336,  0.220,  -0.177 


From  the  table  one  computes  readily 


217.170 


-,542; 

6  (24) 


D  =  max  (.158,  . 200,  •••,  .123,  0.000)  =  1.583;  and 
M  =  1. 


Consequently,  the  decisions  are 
W2( 6,  5,  5,  8)  detector:  NO 
D( 6,  5,  5,  8)  detector:  NO 
M  detector:  NO 


In  order  to  compute  S(U0),  S'(<b)  and  Q(nlt  •  •  * ,  n4  ;  3)  it  is 
convenient  to  construct  tables  18  -  20. 


Table  18.  S{U0) 


Entries  are  the  ranks  R(Jij). 
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From  these  last  three  tables  one  computes,  respectively, 


S(U0)  = 


5(4)  = 


12 


(24)  (25) 


ml  +  +  mi  +  (ml 

6  5  5  8 


3(25)  =  3.432 


(1.  018)“  (.  845)“  (-3.  351)“  (1. 486)“ 


(.  002)“  _ 
24 


2. 838; 


and  <3(6,  5,  5,  8;  3)  = 
3 


32  +  02  4  32 


6 


9  9  9 

r  +  2“  +  2“ 

+  — - —  + 


32  +  22  4  02 


l2  4  42  4  32 


-  24  =  7.95 


These  detectors  then  decide  as  follows; 
S (U0)  detector;  NO 
5(4)  detector;  NO;  and 
§(6,  5,  5,  8;  3)  detector;  NO. 
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E.  GOODNESS  CRITERIA  FOR  MODEL  III  DETECTORS 


The  goodness  criteria  treated  for  Model  I  and  Model  II  de¬ 
tectors  are  primarily  related  to  one-sided  alternatives  as  illus¬ 
trated  previously  (see  page  41.)  Unfortunately,  when  there  are 
three  or  more  samples  to  be  compared,  the  term  "one-sided 
alternatives"  does  not  have  a  clear  meaning,  and  the  number  of 
reasonable  interpretations  is  excessively  large  to  tabulate.  Con¬ 
sequently,  no  attempt  will  be  made  to  apply  the  max-min  PFD 
criterion  or  that  for  PFD  with  Rayleigh  alternatives. 

12 

No  statistical  work  comparable  to  that  of  Bell  and  Doksum 
is  currently  available  for  adaptation  to  Model  III  detectors.  Hence, 
the  ALM  PFD  criterion  will  not  be  employed  here. 

However,  the  statistical  work  of  Puri  and  of  Bell  and 
Doksum11  can  be  adapted  to  give  ARE  results  for  some  of  the  rank- 
sum  Model  III  detectors.  These  results  are  replicas  of  tables  6 
and  14  with  the  appropriate  changes  of  statistics.  From  tables  21 
and  22  it  is  evident  that  the  following  adaptation  of  the  results  of 
Puri18  and  of  Bell  and  Doksum11  is  valid. 


Theorem  12.  Under  the  regularity  conditions  of  Puri18 
and  for  translation  alternatives,  one  has: 


(i)  1  [S(H),  =  aIs(H),  £(#)]  =A  ls'(H),  S{H)]  =1 

for  Model  in  detectors;  and,  in  comparing  corresponding 
Model  I,  II,  and  in  rank-sum  detectors, 


(ii) 


A  [ K(H ), 
A  lS{H), 


K(G)] 

T(G) 

*(<?>] 


=  A[V(H),  7(G)]  =A[V'{H),  7'(G)]  = 
=  .4  [S(H),  S  (G )]  =A  [S'(H),  £’(£)]  = 
,  for  all  (sufficiently  regular)  H  and  G. 
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Table  21.  ARE's  for  Model  III  Detectors 
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Table  22 .  Statistical  Distributions  for  Model  III  Detectors 


Statistic 

Distribution 

Soui'ce  of  Tables, 
Formulas ,  etc . 
(Numbers  refer  to 
Bibliography) 

-D(ni,  * '  ’  > n.\>  2,  1) 

(k -sample  Kolmogorov  - 
Smirnov) 

34 

W2(nl3  ■  ■  •  ,nk,  2,  1) 

(k -sample  Cramer- 
von  Mises) 

34 

Q(nv - ,nk,  b) 

(Extended  median) 

Asymptotically 

chi-square 

28,  p.  293; 

21,  p.  49  ff 

a 

M 

(Moste  lie  r-Tukey) 

33 

S(U0),  S'(U0) 

Asymptotic  chi-square 
(Kruskal-Wallis) 

21,  p.  49  ff;  19, 
p.  290  ff 

Asymptotic  chi-square 
(Van  de  Waerden) 

21,  p.  49  ff;  32 

S'  (<b) 

(Fisher-Yates) 
Asymptotic  chi-square 

21,  p.  49  ff; 

31,  p.  66  ff 

.?($) 

Chi-square 

21,  p.  49  ff;  11 

5 


REVERSE  SIDE  BLANK 
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